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A mi hermano 



Preface 



This dissertation reports the research I have carried out as a PhD student in the 
Statistical Field Theory Group of the UCM, where I have been privileged to work 
and learn under the guidance of Luis Antonio Fernandez and Victor Martin. They 
not only form a truly remarkable scientific partnership, but have also been the best 
advisors I could ever have hoped to have. 

My work during this time constitutes an attempt to make some headway in the 
field of complex condensed matter systems, concentrating on disordered spin mod- 
els and taking a Monte Carlo approach. In particular, this dissertation deals with 
two archetypical systems: the diluted antiferromagnet in a field and the Edwards- 
Anderson spin glass. The former is studied with Tethered Monte Carlo, a formal- 
ism developed during this thesis. 

As to the Edwards-Anderson spin glass, I have been fortunate to have access 
to Janus, a special-purpose machine that outperforms conventional computing ar- 
chitectures by several orders of magnitude in the Monte Carlo simulation of spin 
systems. This would be akin to being one of a few particle physicists with access 
to a newer, vastly more powerful collider {si parva licet componere magnis) and has 
allowed our group to tackle head-on a much studied model and still see some new 
physics. From the point of view of a PhD student, it has been a unique learning op- 
portunity. Janus is the fruit of a collaboration of physicists and engineers from five 
universities in Spain and Italy. The project is directed by Alfonso Tarancon and has 
as scientific coordinators Victor Martin Mayor, Giorgio Parisi and Juan Jesus Ruiz 
Lorenzo. My own participation has been, of course, only as a very junior member 
of a large collaboration, so I only include in this dissertation those physical studies 
where I carried out a major fraction of the work. 
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CHAPTER I 



General introduction 



Modern physics is steadily broadening its scope and tackling increasingly complex 
systems, whose rich collective behaviour is not easily explained from the often 
simple nature of their constituent parts. Thus, a lot of attention is being focused 
on understanding, from a fundamental point of view, an extremely diverse class 
of problems, ranging from vortex glasses in high-temperature superconductors to 
biological macromolecules. The featured physical phenomena can be as exotic as 
the colossal magnetoresistance of some manganites liPAGOii |COeo9} iLEV02i , or as 
familiar as the formation of glass | iANG93} |DEB97f ideboiI . The latter constitutes 
a particularly conspicuous example of an everyday material whose microscopic 
description remains, in the words of P. W. Anderson jAND93| , 'probably the deepest 
and most interesting unsolved problem in solid state theory'. On a different vein, 
the study complex physical systems has deep relations to the field of computational 
complexity and NP-incompleteness ||mezo2[|zeco6I| . 

The enormous variety of problems, often straddling the boundaries between 
physics, chemistry and biology, seems to suggest that the label of 'complex system' 
bears little meaning, since it seems that each class of systems must surely be studied 
separately. Actually, behind the diversity we can find key unifying features, which 
has motivated attempts to find some solid common ground for a joint treatment of 
complexity. 

In this sense, the best hope of the fundamental physicist is the notion of univer- 
sality ]cAR96t[AMi03| , [zj03i l. In general, a strict microscopic description of a complex 
system is a daunting task: one has to account for many degrees of freedom, whose 
interactions follow complicated laws. Fortunately, one can often identify a few 
crucial scaling variables, whose evolution encodes the behaviour of the whole sys- 
tem. More than that, by expressing all the more complicated quantities in terms of 
these, very different systems can be shown to have the same qualitative behaviour. 
Hence, all these systems can be understood through the study of their simplest 
representative. 

Perhaps the most striking example of this universal behaviour is the celebrated 
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Figure 1.1: Angell plot, taken from IjANGgJJ, showing the viscosity of many glass-forming 
liquids. The horizontal axis is rescaled in terms of a glass temperature Tg, defined as 
that where the system's viscosity reaches the value of 10^"' P. In this representation, the 
deviation from the ideal Arrhenius law (leftmost straight line) seems completely encoded 
by the derivative at Tg. Notice that the values of the viscosity span 15 orders of magnitude. 
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Configuration 



Figure 1.2: Idealised picture of a complex system. The free-energy profile contains many 
local minima, which define metastable states with exponentially long escape times. 
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Angell plot, of which we show an example in Figure In it, glass-forming 

liquids with completely different compositions and qualitatively different temper- 
ature dependencies of the viscous flow are classified according to their 'fragility'. 
This is defined as the (logarithmic) derivative of the viscosity at the glass transition 
temperature and, as it turns out, it characterises the material's deviation from the 
Arrhenius law, along 15 orders of magnitude. Scant few physical quantities have 
ever been measured along so wide a range but, beyond that, the figure encloses 
very deep physics. 

For instance, fluctuation-dissipation relations jKUB37] allow us to translate vis- 
cosity into time. Therefore, the plot is showing us a situation where relaxation 
times cross over from a microscopic to a macroscopic range. Notice that the scaling 
temperature in this plot is chosen as the point where the viscosity reaches 10^^ P 
(equivalent to relaxation times of one hour). More than that, and one has to accept 
working out of equilibrium, which, in some fields, is a tough pill to swallow. 

In the context of magnetic systems — where, unlike with glasses, one is sure of 
being below a phase transition — the off-equilibrium regime is a completely natural 
experimental environment and has been for some time. A classic application is the 
study of coarsening, a kind of dynamics characterised by the growth of compact 
coherent domains. In this case, an especially powerful version of universality op- 
erates, aptly called superuniversality | |fis86b ||. According to it, all the spatial and 
temporal scales during the dynamics are encoded in the growth of a coherence 
length, which indicates the size of the coherent domains (cf. Chapter [9]). 

In general, we can say that the most common feature of complex systems is 
an incredibly slow dynamical evolution, or aging | str78[ BOU98I. The study of 
non-equilibrium relaxation is, thus, very important and often the only accessible 
experimental regime. 

In order to explain this sluggishness, the most often invoked defining char- 
acteristic of complex systems is the picture of a 'rugged free-energy landscape' 
l iFRA97} [jANoS a| . The configuration space is pictured as having many valleys, defin- 
ing metastable states where a configuration is much more favourable than neigh- 
bouring ones (Figure i.2[ >. The system in its evolution, then, must jump from one 
local minimum to another through rare-event states, causing the slow dynamics. 

The causes of this ruggedness are diverse. For some materials, it may be due to 
the presence of impurities or other defects, which hinder the physical evolution. In 
others, the sluggish behaviour has been modelled as a hierarchically constrained 
dynamics, consisting in the sequential relaxation of different degrees of freedom, 
from the fastest to the slowest PPAL84I. 

Sometimes one of the valleys dominates and the free-energy profile is funnel 
shaped. This is the case, for instance, of protein folding, where the native con- 
figuration defines an absolute minimum. For other systems, on the other hand, 
there can be many equally favourable configurations, so one must take several 
metastable states into account even when defining the equilibrium. The difference 
between both cases is not idle: proteins quite obviously are able to fold into their 
equilibrium configuration very quickly (in human terms), while glassy systems 
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Figure 1.3: A small portion of a spin-glass lattice. The spins on the nodes can only take the 
'up' or 'down' orientation. They are joined along nearest-neighbour links by a mixture of 
ferromagnetic (+) and antiferromagnetic (— ) couplings. The former favour aligned pairs 
of spins and the latter favour antiparallel ones. With the arrangement shown in the figure 
there is no way of orienting the spins so that all of the bonds are satisfied. 



with metastable behaviour are perennially out of equilibrium. 

Since in this latter case the equilibrium phase is experimentally unreachable, 
determining what (if any) bearing it has on the dynamical evolution (what we shall 
call the 'statics-dynamics relation') is a non-trivial problem. 

This discussion notwithstanding, we must caution the reader that Figure 



1.2 



IS, 

at best, a metaphor. In order to turn it into real physics one must first, at the very 
least, identify one (or more) appropriate reaction coordinates capable of actually 
labelling the different minima. This requires a great deal of insight into the system's 
physics and still leaves unresolved the non-trivial step of actually computing the 
free energy. 

The quantitative investigation of these two issues (statics-dynamics and the free- 
energy landscape) will constitute the main themes of this dissertation. We shall 
work in the context of disordered magnetic systems, long considered prime exam- 
ples of complexity]^ One may think that the introduction of disorder cannot be 
responsible for very exciting changes in a physical system. This is true in some 
cases (after all, even the most perfect experimental sample has some impurities, yet 
we can still talk of crystals or ferromagnets), but not in general. For some strongly 
disordered systems, we shall see, the impurities have a dramatic effect both in a 
technical sense (being a relevant perturbation in a renormalisation-group setting) 
and in a very physical sense. Consider, for instance, the example of Anderson 
localisation |and38[ , capable of turning metallic systems into insulators. 

One of the simplest mechanisms responsible for the complexity of disordered 
systems is that of frustration jTOU77| , as very clearly illustrated in the case of spin 
glasses (see, e.g., [Ibin861 mezSj | and cf. Chapter [9]). These are magnetic alloys in 



^We shall give a detailed introduction to these systems in their respective chapters, for now we 
limit the discussion to a few general comments. 
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which the interactions between the spins are in conflict. The disorder manifests 
as a mixture of ferromagnetic and antiferromagnetic couplings. Thus, even for the 
lowest-energy configuration some of the bonds are necessarily frustrated, as we 



see in Figure 1.3 This makes it exceedingly difficult for the physical system to 
find the equilibrium state (and no less for the physicist performing a computation). 
Furthermore, many configurations have a similar degree of frustration and are 
therefore equally favourable, giving rise to a free-energy landscape with many 
relevant metastable states. 

Yet not only the equilibrium is complicated but also the dynamical behaviour. 
In the case of spin glasses, to the aging behaviour we have to add phenomena such 
as rejuvenation or memory effects [J ON98J . 

The choice of disordered magnetic systems as paradigmatic models for com- 
plexity is mainly due to their permitting more precise experimental studies than 
other classes of complex systems | iMYD93[ bel98| |. There are both technical and 
physical reasons for this, as we shall see later. 

On the theoretical front, on the other hand, these systems are at least easy to 
model with deceptively simple lattice systems. The solution of these models is an- 
other matter entirely. Indeed, disordered systems have often defied traditional, and 
powerful, analytical tools such a perturbation theory [|domo6I| . In the case of spin 
glasses, even the solution of such a gross simplification as the mean-field approx- 
imation has been a veritable tour deforce [TAK'ygA\ Ipar8o[ . Other systems, such as 
the random field Ising model, are seemingly more amenable to a perturbative treat- 
ment IIDOM06I |nat98[ but remain very poorly understood, the analytical approach 
failing to analyse the critical behaviour convincingly. 

In the last decades, a third avenue has opened for basic research: the com- 
putational approach, of which the most salient example is Monte Carlo simula- 
tion jLAN03} |RUB07 |. Unfortunately, in the case of disordered systems traditional 
Monte Carlo methods suffer from the same problems as experiments do, to an even 
greater degree. A simulation of a rugged free-energy landscape for a finite lattice 
will get trapped in the local minima, just as an experiment, with an escape time 
that grows as the exponential of the free-energy barrier, which in turn goes as a 
power of the lattice size. As a consequence, for many models of interest only very 
small systems can be thermalised, making extrapolation to the thermodynamical 
limit very complicated. Many simulation algorithms have been proposed to ad- 
dress this problem. However, as a general rule efficient innovations require some 
previous knowledge, or at least a somewhat detailed working hypothesis, of the 
underlying physics. Therefore, the investigation of new Monte Carlo dynamics 
must not be considered in isolation, but as an enterprise that should be undertaken 
jointly with a thorough study of challenging physical problems. 

An alternative is simply to eschew equilibrium and attempt a Monte Carlo re- 
production of an experimental dynamics. This is simple enough in principle, the 
most straightforward Monte Carlo dynamics being good mock-ups of the physi- 
cal evolution, and has the advantage of considering the system in more controlled 
conditions that are possible in a laboratory. Unfortunately, current computers are 
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several orders of magnitude too slow to reach the experimentally relevant time 
scales. 

In short, the study of complex systems faces significant obstacles, both experi- 
mental and theoretical. From a fundamental point of view, far from discouraging 
further effort, these problems are at heart the reasons why these systems are so 
interesting, constituting a constant reminder that the traditional tools of statistical 
mechanics must be continuously complemented and expanded. This in addition 
to the fact that by 'complex systems' we encompass such everyday materials as 
glasses, as well as systems with great technological or medical relevance (colossal 
magnetoresistance oxides, proteins, etc.). 



1.1 

Scope of this thesis 

This thesis is an attempt to provide a new outlook on complex systems, as well as 
some physical answers for certain models, taking a computational approach. We 
have focused on disordered systems, addressing two traditional 'hard' paradig- 
matic models in three spatial dimensions: the Edwards-Anderson spin glass and 
the diluted antiferromagnet in a field (the physical realisation of the random-field 
Ising model). These systems have been studied by means of large-scale Monte 
Carlo simulations, exploiting a variety of platforms (computing clusters, supercom- 
puting facilities, grid computing resources and even a custom-built special-purpose 
supercomputer). In accordance to the above discussion, the physical study has been 
taken hand in hand with the development of new Monte Carlo methods. 

Indeed, at the foundation of the work reported herein is the development of 
Tethered Monte Carlo, a general strategy for the study of rugged free-energy land- 
scapes. This formalism provides a general method to guide the exploration of con- 
figuration space by constraining (tethering) one or more parameters. In particular, 
one selects a reaction coordinate (typically, but not necessarily, order parameters) 
capable of labelling the different local free-energy minima. A statistical ensemble is 
then constructed, permitting efficient Monte Carlo simulation where these coordi- 
nates are fixed, avoiding the need to tunnel between competing metastable states. 
From these tethered simulations the Helmholtz potential associated to the reaction 
coordinates is reconstructed, yielding all the information about the system. 

This philosophy is applied first to ferromagnetic models (hardly complex sys- 
tems, but extraordinary benchmarks nonetheless) and then to the diluted antifer- 
romagnet in a field. There it is showed that the tethered approach, far from being 
a mere optimised Monte Carlo algorithm, is capable of providing valuable infor- 
mation that would be hidden from a traditional study, thus permitting a more 
complete picture of the physics. One of the more conspicuous examples of this is 
that pictures such as Figure |i.2[ long treated as mere conceptual aides, have been 
turned into real computations of free-energy profiles. 
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The next part of this dissertation is concerned with the Edwards- Anderson spin 
glass. For this system, our physical understanding is not yet at a level that would 
permit a full tethered treatment. This notwithstanding, many pointers are taken 
from the tethered philosophy, particularly in regards to the analysis of physical 
results. For the Monte Carlo simulation, the strategy has been mainly one of sheer 
brute force. Our work on spin glasses has been conducted within the Janus Collab- 
oration, a joint effort of researchers from five universities in Spain and Italy. This 
project has as its main goal the construction and exploitation of Janus, a special- 
purpose computer optimised for spin-glass simulations, where it outperforms con- 
ventional computers by several orders of magnitude. 

Aside from having been carried out with slightly different methods, our work 
on spin glasses is complementary to the rest of this thesis in a major physical way: 
it makes a strong emphasis on off -equilibrium dynamics. Indeed, as we advanced 
in the previous discussion, experiments on spin glasses (and many other complex 
systems) are always performed in an off-equilibrium regime. Then, a very valid 
question arises: how relevant is it to know the unreachable equilibrium phase? 
Our outlook, as we said before, has been that equilibrium structures, though in- 
accessible, do condition the off-equilibrium evolution. This idea, long accepted as 
a working hypothesis, is turned into a quantitative statement with the finite-time 
scaling paradigm. Time is treated as a state variable, much as pressure or tem- 
perature in a traditional thermodynamical setting. A time-length dictionary links 
off-equilibrium results in the thermodynamical limit for finite times with equilib- 
rium results for finite lattices. 

The next section summarises the organisation of the rest of this dissertation. It 
should be noted that each section contains a more detailed topical introduction, 
expanding on the issues touched in this General Introduction. 

1.2 

Organisation of this dissertation 

As discussed above, the work reported herein is concerned both with the study 
of paradigmatic complex systems in statistical mechanics (the DAFF and the spin 
glass) and with the development of new Monte Carlo and analysis methods. The 
rest of this dissertation is, therefore, organised thematically in the following way: 

• Part|l| including this General Introduction, has the purpose of motivating our 
study and presenting our outlook on complex systems and how they should 
be treated. We start by, very briefly, recalling some statistical mechanical con- 
cepts relevant to the study of complex systems (Chapter [2]). This is followed 
by Chapter already concerned with our Monte Carlo approach. In it we ex- 
pand on the practical problems posed by the numerical simulation of rugged 
free-energy landscapes and motivate the tethered formalism as a way of re- 
moving or alleviating them. This last chapter contains some material from 
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our paper [|MMii j | . 

Part [n] deals with our work on new Monte Carlo methods, motivated by the 
above considerations. We start by detailing the construction of the tethered 
formalism in Chapter |4j We then present a first demonstration of the method 
in a straightforward application: the D = 2 Ising model (Chapter [5]). This 
is, of course, a well understood system, so our aim is not so much present- 
ing new physics as demonstrating our methods and how a tethered study 
can provide a complementary picture to canonical methods. Finally, Chap- 
ter |6] proves that the tethered formalism is compatible with advanced Monte 
Carlo algorithms, in this case cluster methods. We first introduced the Teth- 
ered Monte Carlo method in [ jFERogcf and we later presented it in a more 
general context in Immii] . Chapters [5] and |6] contain material from jFER09ci l 
and jMMog] , respectively. 

Part |III] discusses the first class of complex systems we shall study: the diluted 
antiferromagnet in a field. In Chapter [t] we present the situation that existed 
prior to our work and demonstrate how this system is particularly ill-suited 
to a study with canonical methods. We then tackle the problem with the 
tethered formalism in Chapter |8| applying all the techniques introduced in 
Part |IIj This chapter is a much expanded version of [Iferiib] , including some 
material from [ jMMiii . as well as some unpublished results. 



Part IV deals with spin glasses. In Chapter [9] we introduce these systems. 



stressing their peculiarities from an experimental point of view (with phe- 
nomena such as aging, rejuvenation, memory, etc.) and their resulting the- 



oretical importance as paradigmatic complex systems. Chapter 10 deals in 
depth with one of the main themes of this thesis: the relationship between 
equilibrium and non-equilibrium and introduces the finite-time scaling frame- 
work. Finally, Chapter [11] studies in detail the structure of the spin-glass 
phase in three spatial dimensions. Our work on spin glasses, carried out 



within the Janus collaboration, was published in |jANo8ct |jano9A| , |janToaJ 



Ijaniob| . Chapters 10 and 11 are a heavily reworked and reorganised version 



of the results in those papers, including some unpublished material. 
Finally, Chapter [12] contains our conclusions. 

We include several appendices. Appendix A gives some notes on how to 
assess thermalisation in Monte Carlo simulations. It starts reporting some 
standard definitions but then describes some new techniques for complex 
systems (first introduced in jjANiOAj ). Appendix |B] presents some techniques 
for analysing the strongly correlated data produced in the Monte Carlo simu- 
lation of disordered systems. The explained methods are illustrated with es- 
pecially tough examples and case studies taken from our work. Appendix [C| 
contains some practical notes for an efficient numerical implementation of 
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Tethered Monte Carlo. Finally, Appendix D introduces the Janus special- 
purpose computer used in our spin-glass simulations and Appendix |E] gath- 
ers all the technical information on these runs (parameters, thermalisation, 
etc.). 



CHAPTER II 



Statistical mechanics of disordered systems: 
basic definitions 



In this chapter we briefly recall some general deflnitions that will we employed 
throughout this dissertation, with the main purpose of flxing the notation and 
introducing some notions particular to the study of disordered systems. For gen- 
eral references on statistical mechanics or the theory of critical phenomena see, 
e.g., IIlanSoI [hua87[ [AM103} |lb91[ \z]o^\ |car96[ . For disordered systems, see |mez87[ 
YougSf Idomo61 [pUToTil . 



2.1 

Statistical mechanics and critical phenomena 

We consider a system whose configuration can be specified by N degrees of free- 
dom {sx}- In the canonical ensemble, the behaviour of the system is encoded in 
the partition function 

Z = E e-^^««-». (2.1) 

In this equation, the sum is extended to all possible configurations, their relative 
weights depending on their energy E. The parameter (i = 1 /k^T is the inverse 
temperature. We use units where the Boltzmann constant is = 1/ so j6 = 1 / T. 

Throughout this dissertation we study Ising spins, = ±1, on a square lattice 
of N = nodes, where D is the spatial dimension. Therefore, in the following we 
often use the language and notation of magnetic systems, even if much of what we 
say can be applied to more general systems. 

We define an observable 0{{sx}) as any function of the spin configuration. In the 
context of Monte Carlo simulations, where we estimate Z by means of a random 
walk in configuration space, we use the word measurement for a single evaluation 
of the observable during the simulation. 
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The total energy of the system can often be written in the following way 

E{{s,}) = U{{s,})-hX{{s,}), (2.2) 

where 17 is the interaction energy of the spins and h is an external field, coupled 
to some reaction coordinate X. In our case, U will be of the form of a two-spin 
interaction 

^{{Sx}) = - E/^2/^^V (2.3) 
x,y 

where the Jxy are the couplings. For instance, for the ferromagnetic Istng model, 
Jxy = 1 if X and y are nearest neighbours on the lattice and zero otherwise. We 
represent this sort of nearest-neighbours interaction as 

In this context, the most straightforward reaction coordinate is the magnetisa- 
tion M, 

M{{s,})=J^s,, (2.5) 

X 

although we shall consider other kinds. 

Therefore, the canonical ensemble describes the system at fixed temperature 
and applied field. The basic thermodynamic quantity is the free-energy densit}|^ 

fN(/3,^) = ^logZ(/3,h). (2.6) 
The average value of an observable in the canonical ensemble is denoted by 

(0) = 7 E 0{{sx}) e-^^«^^». (2.7) 

For quantities such as M or E we distinguish the extensive version from the 
density by the use of uppercase and lowercase symbols, respectively, 

M = Nm, E = Ne. (2.8) 

Notice that we do not use this convention for the free energy, F]v, which is also a 
density, but only for observables. 



^This is often defined with a different normalisation, J-jv = — ^^logZ, but we shall find our 
definition more convenient later on. 



2.1 — Statistical mechanics and critical phenomena 



35 



2.I.I Legendre transformation 

Throughout Part we shall find it interesting to consider an alternative ensemble 
where it is the reaction coordinate m, and not the field h, which is kept fixed. In 
classical thermodynamics, this is accomplished through the Legendre transforma- 
tion, defining a new basic potential. 

nN(iS,m) = ^mh - F^{^,h). (2.9) 

We say Djv is the Helmholtz free energy (f]v is the Gibbs free energy). Since these 
names are often exchanged in the literature, we avoid confusion by reserving the 
name 'free energy' for F^, while we shall call Djv 'effective potential', borrowing 
the name from the context of quantum field theory. If Fjv is a convex function of 
h, this operation allows us to define Cl^ as a convex function of m. However, for 
disordered systems with rugged free-energy landscapes Cl^^ is never convex for 
finite N. Therefore, we shall consider the following alternative representation of 
the transformation, 

Zn(/3, h) = e^fN(M) = I dm e''^f^hm-n^{f^,m)] ^ ^^^^^ 
The conjugate nature of m and h can be summarised by the following formulae 

, , lafN 



|6 dh 



1 anjv 



jS dm 



(2.11) 



where by (• • •)^ we denote the expectation value in the fixed-m ensemble defined 
by D]v. 



2.1.2 Critical phenomena and exponents 

We shall often consider the behaviour of physical systems in the neighbourhood of 
a second-order phase transition, where the system approaches continuously a state 
at which the scale of correlations becomes unbounded. In particular, if we write 
the correlation between sites x and y as 

, , |x — 1/1^-00 I , T-N 

{SxSy) > exp{-\x-y\/^), (2.12) 

then, in the thermodynamic limit, the correlation length ^ diverges as we approach 
the critical temperature Tc. We characterise the divergence by a critical exponent v 

^r^\T-T,\-\ (2.13) 

This behaviour is not exclusive of the correlation length, many other quantities 
either diverge or vanish as we approach T^. Therefore, one defines additional 
critical exponents: 
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Response of the system to an infinitesimal field h (for instance, the magnetic 
susceptibility), 

X^\T-Tc\~^. (2.14) 

Specific heat 

Cr^\T-T,\-r (2.15) 

Order parameter (for instance, the magnetisation) 

m^ {Tc- T)^. (2.16) 



Unfortunately, this exponent uses the same symbol as the inverse temperature 
in what is a completely universal usage, but the context should always make 
clear which is the referred quantity. 

• Precisely at T = Tc, the decay of the correlation function is characterised by 
the anomalous dimension 

(s,S,)^|x-t/|-(°-2+'?). (2.17) 

• Finally, again at T = Tc, the order parameter has a critical dependence on the 
applied field h 

m - h^^\ (2.18) 

We have thus defined six different critical exponents. However, very general con- 
siderations let us establish the so-called scaling relations 

2|6 + 7 = 2 - a, (2.19a) 

2^5 - ^ = 2- Di, (2.19b) 

7 = v(2-7), (2.19c) 

vD = 2 — a. (2.i9d) 

The last of these, involving the dimension D, is called a hyperscaling relation. 



Using (2.19), we see that for a general system, only two critical exponents are 
independent. 

In mean-field theory the exponents are j6 = v = l/2, 7 = 1, a: = ;/ = 0, 
^ = 3. Above some (model-dependent) upper critical dimension Du all systems are 
described by these mean-field exponents. Notice that they do not depend on D, in 
contrast with the hyperscaling law. Usually, Du = 4, but we shall see that this is 
not the case for disordered systems. Finally, below the lower critical dimension Dj 
there is no transition. 
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2.1.3 Finite-size scaling 

One of the most useful tools for the study of critical phenomena is the scaling 
hypothesis. According to this, in the thermodynamic limit the correlation length 
^00 is the only characteristic length of the system in the neighbourhood of Tq (in 
this regime, the correlation length is large in units of the lattice spacing and the 
system 'forgets' about the lattice). 

In a finite lattice, the corresponding finite-size scaling (FSS) ansatz states that 
the finite-size behaviour is determined by the ratio L/^oo- If the ratio is large, the 
finite-size effects are not important and the finite system is not essentially different 
from the thermodynamic limit. If the ratio is small, however, we say we are in the 
FSS regime. There, an observable O will behave as 

(0)('):^L-°/7o(i^/^oo), (2.20) 
where Xq characterises the critical behaviour of O, 

(0)(~) - iT-TcT^o. (2.21) 
Alternatively, using the definition of the critical exponent v we can write 

(0)(^) ~ L-o/y~f^^iyyt), t = {T,- T)/T,. (2.22) 

The FSS ansatz can be derived using renormalisation-group techniques (see, for 
instance, jAMi03] ). 

Note that in a finite lattice one cannot really talk of a phase transition. There are 
no actual divergences of the physical quantities, only ever narrowing peaks whose 
position te nds t o the real critical point and whose height grows as L^o/^. in this 



sense, Eq. (2.22 1 encodes the behaviour in a crossover region of width ^ L~^^^ 
between the two phases. In the thermodynamic al limit, this interval degenerates in 
a point and the crossover turns into a proper phase transition. 

Finally, let us note that at the critical point ^00 — > 00, so the scaling hypothesis 
leads to the conclusion that the system exhibits scale invariance, since there is no 
characteristic length. This is an important observation for detecting the presence of 
a second-order transition. 



2.2 

Quenched disorder 

Let us now consider a system with disorder. In addition to the spins Sx, we need 
to specify additional variables jii that characterise the randomness. In our model 



Hamiltonian (2.2 1, these can be random couplings Jxy, vacancies in the lattice or 



even a random, site-dependent field h;^ 
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In principle, for each configuration of the disorder variables (disorder realisa- 
tion or sample) we will have a different partition function 

Z,, = E e-^^"»^--». (2.23) 
{%} 

If the disorder variables exhibit a dynamical evolution in time scales short com- 
pared to the observation time (diffusion of impurities at high temperatures, for 
instance) we say the disorder is annealed. In this situation, we can treat the jdj as 
additional dynamical variables and average over them, to obtain the complete par- 
tition function. We denote this disorder average by an overline, (•••), so the free 
energy of the system is 



1 

-N=^logZ,,. (2.24) 



We are interested in the opposite limit, where the impurities show no dynamical 
evolution in experimental time scales. We say the disorder is quenched, so the free 
energy is different for each sample 



1 

fN(?^) = ^logZ^. (2.25) 



This does not seem like a useful concept, because it seems to imply we would need 
a different model for each particular piece of material. What actually happens is 
that, for large enough systems, the physical properties do not depend on the 
anymore, 

lim fjv(f^) = foo. (2.26) 

There is a simple argument for this in finite dimension, due to Brout jBR039] . We 
divide the lattice in many macroscopic systems of size 1 < K° < N. Then the 
free-energy density of the whole system will be the average of those of the N /R^ 
subsystems, plus a contribution coming from interactions between them. If we as- 
sume that the interactions are short-range, this latter contribution is an interface 
energy, negligible in the large-N limit. Therefore, computing the free-energy den- 
sity of a very large lattice is essentially the same as averaging that of many smaller 
systems and the central limit theorem guarantees that 

r2 1 



- Fn{^) - ^ • (2.27) 

We say the free energy self-averages. 

So, the concept of quenched disorder is physically sound, but it implies a seri- 
ous difficulty. In order to obtain physically meaningful results, we have to average 
the free energy, which is the same as averaging the logarithm of Z^ 

1 

N' 



-N = Fn{}i) = —log Z^ . (2.28) 
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The task of computing the average of a logarithm, unusual in statistical mechanics, 
is exceedingly difficult. There is, however, a way around it: the so-called replica 
method | |kac681[edw72] . This is based on the elementary relationship 

Z" - 1 

loeZ = lim . (2.20) 

" n^O n 

For positive integer n, can be expressed in terms of identical replicas of the 
system (sharing the same configuration of the fij) 



Zj = flzW= Eexp[-/3f:H(^)({s«}) 



(2.30) 



=1 {sj} L a=l 

This quantity is easier to average over disorder. The objective then, is to obtain a 
replica partition function 

Zn = Z|, (2.31) 

that no longer depends on disorder and afterwards take the limit n — )■ in (2^29]). 
This procedure can be mathematically delicate in some cases (see, e.g., | mez87[ ). 

Let us consider now the disorder average of an observable 0{{sx})- In principle 
we have to do 

(Oy = ^ E e-^^"«^^»0({s,}), (2.32) 

which has the unpleasant feature of having disorder variables both in the numera- 
tor and denominator. This can be solved multiplying both by Z"~^, 



(0) = %^L e-^^"«^^>)0({s,}). (2.33) 

Now we write the numerator in the replica notation, assigning the original partition 
function to replica 1 (this choice is, of course, arbitrary) 



In the n — 7- limit the denominator goes to one and we have 

(O) = lim E e-^^«<*««^»0({si}). (2.35) 

In a successful application of the replica trick, one hopes to integrate the depen- 
dence on the fij explicitly and define an effective Hamiltonian Tin that no longer 
depends on the disorder (only on n). Then 

(O) = lim (0)„ = lim E e-^^«({^J})0({si}). (2.36) 

In this dissertation we do not carry out any such replica computations, but in 
Chapter [9] we give an outline of a particularly famous example: the mean-field 
theory of spin glasses. 
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2.2.1 The relevancy of disorder 

Real systems always have some measure of disorder, either in the form of impuri- 
ties or vacancies in the lattice. However, this disorder is not always relevant in the 
sense of changing the universality class of the system. 

For simplicity let us consider a ferromagnetic system, where we introduce some 
disorder in the couplings. We consider a Hamiltonian of the form 

'H = -J^JxySxSy, (2.37) 

x,y 

and let us write the couplings Jxy as a translationally invariant part plus a pertur- 
bation 

Jxy = }{\x-y\)+S}xy. (2.38) 

We say that a system described exclusively by the translationally invariant part is 
the 'pure' system corresponding to our disordered model. 

Then, there are two interesting limiting cases. In the first, the disorder is strong, 

so 

^Jxy::^J{\x-y\). (2.39) 

Therefore, the disorder completely dominates the low-temperature properties of 
the system. In particular, the low-temperature ferromagnetic order is destroyed 

and the system is described by a 'spin glass' phase where (sx) = but (sx) 7^ 0. 



The Edwards- Anderson model, which we study in Part IV is one example. 
On the other hand, the disorder may be weak 

SJxy <^ }{\x-y\). (2.40) 

In this case, one would not expect the disorder to effect great changes in the 
ground-state properties. The low-temperature phase would continue to have a 
ferromagnetic order, for instance. However, in the case of a second-order phase 
transition for the pure model, the critical exponents may change. Furthermore, if 
the transition of the pure system is of first order, it may become continuous. 

There is a useful criterion for determining whether the weak disorder is going 
to be relevant, due to Harris IHAR74] . According to it, if the specific-heat exponent 
of the pure system is a.^^^ > 0, then the disorder will change the critical behaviour. 
On the other hand, if the specific heat of the pure system is finite, the disorder will 
be irrelevant (it will not change the critical exponents). 



Finally, let us note that in Part III we study the random field Ising model, where 



the disorder is not of the kind described by (2.38 1, but takes the form of random 
fields. In that case the disorder is also very severe, because the randomness couples 
to the local order parameter. 

2.2.2 Self -averaging violations 

We started our discussion of quenched disorder by giving a general argument in 
favour of the self-averaging property. This argument, however, breaks down at the 
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critical point, where the correlation length diverges and our division of the lattice 
in smaller subsystems with negligible interaction no longer works. Therefore, the 
issue of self-averaging becomes non-trivial at the transition point. 

In fact, it has long been known that for spin glasses there is no self-averaging in 
the ordered phase |Ibin86[ . For systems with weak disorder, a framework analogous 
to the Harris criterion can be established. 

Let us consider some macroscopic quantity O (the magnetisation, energy, etc.) 
and let us consider the probability distribution of the (O) for different samples, 
which we characterise by its relative variance. 



(O) 

We say the system is self-averaging if 



Ro y 0. (2.42) 



Aharony and Harris |aha96[ reached the following conclusions: 



1. Away from the critical region, we can apply the Brout argument and 

Ro-^i^/Lf. (2.43) 
We say that the system is strongly self-averaging. 

2. At the critical point we have to distinguish two possibilities. 

(a) The disorder is irrelevant (in the sense of the Harris criterion). Then for 
the pure system a^^^ < and 

The system is weakly self-averaging. 

(b) The disorder is relevant. In this case the system is no longer self -aver aging, 

lim Ro ^ 0. (2.45) 

Soon after the pioneering renormalisation-group work of Aharony and Harris, sev- 
eral authors studied the issue of (lack of) self-averaging in disordered systems with 
numerical simulations jwiS93} IPAZ97} |wiS98[ |bal98[ |BER04y 



This break down of the self-averaging property is an additional difficulty for 
the study of the critical behaviour of disordered systems. In Chapter |8| however, 
we shall demonstrate an approach that minimises its effects, in the context of the 
random field Ising model (a system where the violation of self-averaging is partic- 
ularly severe liPAR02i |wuo6l Imal"o61 ifytii]] ). 



CHAPTER III 



Managing rugged free-energy landscapes: 
a Tethered Monte Carlo primer 



Monte Carlo (MC) simulation (see, e.g., jLAN03f |rubo7} |sok97} |new96[ for general 
reference works) constitutes one of the most important modern tools of theoretical 
physics. At a first glance, it seems a very inefficient method: its statistical char- 
acter meaning that the uncertainty in the result decreases only as 1 / \/N , where 
A/" is a measure of the numerical effort. From a closer inspection, however, comes 
the realisation that deterministic numerical methods, typically thought to converge 
with higher powers of M or even exponentially, quickly lose their efficiency when 
the number of degrees of freedom is increased (think, for instance, of the compu- 
tation of multi-dimensional integrals). In contrast, the 1/ ^/M behaviour of the MC 
method, a consequence of the central limit theorem, is stable. 

In the context of statistical mechanics, we are interested in extracting system 
configurations that follow some complicated probability distribution, with a huge 
number of degrees of freedom — typically p({sx}) , for systems in the 

canonical ensemble (2.1 1. This is accomplished by means of a dynamic Monte 
Carlo, where the generation of a new configuration depends on the current one. 
In technical terms, we set up an ergodic Markov chain whose stationary distribu- 
tion is the physical distribution describing the equilibrium state of the system (cf. 
Section A.i| . Once the stationary regime is reached, we estimate (O) as an un- 
weighted average ofO(i) = 0{{sx{t)}). The error then goes as ^ 1/ A/A/gteps/ but 
with a potentially large pref actor (see Appendix [A]>. 

At a first glance one may think this probabilistic method is a poor alternative 
to traditional tools such as perturbation theory. Yet, among the most interesting 
problems in statistical mechanics and quantum field theory we often find strongly 
coupled systems, far from the perturbative regime. In these situations, most of our 
analytical tools break down and MC simulation emerges as one of a handful of 
workable methods. 

This is not to say that a MC computation does not have its difficulties. Chief 
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among these is the issue of thermalisation: before we can even start to worry about 
the 1 / y/j\f behaviour of our numerical precision, the Markov chain has to reach 
its stationary distribution. For the most physically interesting regime, the neigh- 
bourhood of a phase transition, this turns out to be difficult, because of critical 
slowing down jHOH77[ , |zj03] . This phenomenon consists in the rapid growth of the 
characteristic times with the system size. Even for very simple systems, such as the 
Ising model, the thermalisation times of traditional MC methods grow as L^, with 
z ~ 2 and L the linear size of the system. Only for scant few systems can one find 
optimised dynamics with z < 1 (cf. our study of cluster methods in Chapter |6]). 

In many other situations the critical slowing down is even worse than the z ~ 2 
behaviour. This is the case of the rugged free-energy landscapes considered in 
the General Introduction, where the thermalisation times grow exponentially with 
the free-energy barriers. These barriers not only constitute a formidable stumbling 
block for traditional MC methods, but are also physically interesting in their own 
right. This is because the actual physical evolution of the system is hindered by 
these same dynamical bottlenecks. 

In the following section we give some precise examples of this phenomenon 
and briefly review some of the methods that have been devised to address it. In 
Section \j.2\ we introduce our proposal: Tethered Monte Carlo, a formalism which 
we shall develop and employ throughout Parts III] and III of this thesis. 



3-1 

Free-energy barriers and Monte Carlo simulations 

The most straightforward example of a free-energy barrier is encountered when- 
ever we want to consider first-order phase transitions tGUN83| BIN87I. In these 



situations two phases (ordered and disordered, or with different kinds of order) 
coexist at the critical point. In a traditional MC simulation the system must tunnel 
back and forth between these two pure phases by forming a mixed configuration, 
featuring an interface of size L. This intermediate state has a free-energy cost of 
T,LP~^, where Z is the surface tension and D the spatial dimension of the system. 
Therefore, the probability of crossing the gap between the ordered and disordered 
states scales as exp[— ZL^"^]. Equivalently, the simulation suffers an exponential 
critical slowing down, where the characteristic times grow as exp[ZL^~^]. 

The situation can be even worse, as demonstrated by crystallisation studies. 
Here, even for the simplest models there are many local free-energy minima. These 
correspond to crystals with different symmetries and varying numbers of defects, 
or even to amorphous solids (glasses). See, e.g., [pusSgJ for an experimental exam- 
ple. 

Furthermore, the issue of free-energy barriers is not limited to first-order tran- 
sitions. A prime example of this is the random field Ising model, which we shall 



study extensively in Part III Here there still exists a free-energy barrier between 
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the ordered and the disordered states, but only one of these configurations defines 
a stable phase. The difference with the first-order scenario is that the barriers grow 
as L^, where 6 < D — 1. Therefore, we still have a thermally activated critical slow- 
ing down, with logT ^ (we shall see that 6 = 1.469(20) for D = 3, so this is 
very severe). The difficulty is compounded by the fact that this is a disordered sys- 
tem, so one must consider many disorder realisations in order to get a meaningful 
picture (cf. the discussion in Chapter [2]). 

For the random field Ising model, we at least know the appropriate order pa- 
rameter that signals the phase transition, but this is not always the case. The most 
conspicuous example of this additional complication is the spin glass. The prob- 
lem is patched, but not completely solved, by using real replicas (clones of the 
system with the same disorder realisation, evolving independently under the same 
dynamics). In this case, the actual structure of the ordered phase is still in dispute 
but, at least for finite systems, there are a large number of local minima (see Chap- 
ter [9]). This is a very popular problem, which has prompted the introduction not 
only of ad-hoc MC methods, such as parallel tempering (see Appendix |A]), but even 
of special-purpose computers | |oGi83[ iCRUOii ibalooJ . The latest example of this is 
the Janus machine (see Appendix D ), which we have used for our own spin-glass 
simulations (Part IV of this dissertation). Still, the use of a custom computer may 
accelerate the simulation by a constant factor, but does not change the scaling of 
the thermalisation times which, even with parallel tempering, are believed to suffer 
an exponential critical slowing down below the critical temperature. 

Many optimised schemes and formalisms have been proposed to deal with 
these problems. The case of first-order phase transitions, with two clean and 
easy to differentiate phases, is perhaps the best understood. For instance, mul- 
ticanonical |jBER92| or Wang-Landau |wanoi1 methods consider a generalised sta- 
tistical ensemble. The dynamics consists in a random walk in energy space, cov- 
ering the range bounded by the two competing phases. This strategy is able 
to overcome the free-energy barriers for small systems, but this only delays to 
larger sizes the advent of exponential slowing down jNEU03] . This is mainly due 
to the emergence of geometrical transitions in the energy gap between the two 
phases [1BIS021 |bino3^ |maco4} Imaco61 Inusso6B . 

In a more general case, however, the local minima cannot be distinguished by 
their energies and we have to consider alternative reaction coordinates (typically, 
but not necessarily, order parameters). Examples abound, perhaps the best known 
being the studies of crystallisation in supercooled liquids jwoL93[ Ichoo 61, where 
the different phases can be labelled with a bond-orientational crystalline order pa- 
rameter | |ste83| . The random- walk strategy can be adapted to some of these cases, 
resulting in the so-called umbrella sampling jTOR77] . Unfortunately, for sufficiently 
complex systems considering a single reaction coordinate is not enough. Tuning 
the parameters of a Wang-Landau or umbrella sampling simulation is in these cases 
rather cumbersome. 

A different strategy was first introduced in a microcanonical setting in | mmo7] . 
In this method, one performs independent simulations with a fixed energy along 
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the whole gap, which are then combined with a fluctuation-dissipation formalism 
to yield the entropy of the system. Thus, the need for tunnelling across geometrical 
transitions is eliminated and very large system sizes can be considered. 

In jFERogci l we generalised this microcanonical method to consider any reaction 
coordinate instead of the energy density. In a similar manner, the role played by the 
entropy in the microcanonical setting is taken by the Helmholtz effective potential 
associated to the chosen reaction coordinate. Furthermore, the application of this 
Tethered Monte Carlo method is not necessarily more difficult with several reaction 
coordinates. 

In a tethered computation, one simulates a statistical ensemble where the re- 
action coordinate x is constrained (tethered) to a narrow range around a fixed 
parameter x. This is accomplished through the introduction of a bath of Gaussian 
demons, which absorb the changes in the reaction coordinate, so long as these are 
not too large, to keep x constant. From several such simulations for different values 
of X the Helmholtz potential is readily reconstructed, yielding all the information 
about the system. The tethered formalism is not intended as a mere thermalisation 
speed-up, but it also grants us access to precious information that would remain 
hidden from a traditional approach. 

We will explain the construction of the tethered formalism in Chapter |4| Before 
engaging in detailed derivations, however, it is useful to understand all the steps 
of a Tethered Monte Carlo simulation. The remainder of this chapter provides 
such an outline, actually constituting a self-contained guide to the set-up of a TMC 
computation. 

We note that the following was first published as Section 2 of Immii] , which we 
reproduce here with minor emendations. 



3-2 

Tethered Monte Carlo, in a nutshell 

In this section we give a brief overview of the Tethered Monte Carlo (TMC) method, 
including a complete recipe for its implementation in a typical problem. This is as 
simple as performing several independent ordinary MC simulations for different 
values of some relevant parameter and then averaging them with an integral over 
this parameter. We shall give the complete derivations and the detailed construc- 
tion of the tethered ensemble in Chapter [4} 

We are interested in the scenario of a system whose phase space includes several 
coexisting states, separated by free-energy barriers. The first step in a TMC study is 
identifying the reaction coordinate x that labels the different relevant phases. This 
can be (but is not limited to) an order parameter. In the remainder of this section 
we shall consider a ferromagnetic setting, so the reaction coordinate will be the 
magnetisation density m. 

The goal of a TMC computation is, then, constructing the Helmholtz potential 
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associated to m, njv(/3, m), which will give us all the information about the sys- 
tem. This involves working in a new statistical ensemble tailored to the problem 
at hand, generated from the usual canonical ensemble by Legendre transformation 
(cf. Chapter [2]): 

Since in a lattice system the magnetisation is discrete, we actually couple it to 
a Gaussian bath to generate a smooth parameter, called m. The effects of this bath 
are integrated out in the formalism. 

In order to implement this construction as a workable Monte Carlo method we 
need to address two different problems: 

• We need to know how to simulate at fixed ih. 

• We need to reconstruct Of^{^, m) from simulations at fixed m and, afterwards, 
to recover canonical expectation values from ( |2.io[ > to any desired accuracy. 

We explain separately how to solve each of the two problems, in the following two 
paragraphs. 



3.2.1 Metropolis simulations in the tethered ensemble 

Let us denote the reaction coordinate by m (for the sake of concreteness let us think 
on the magnetisation density for an Ising model). The dynamic degrees of freedom 
are {s^}. Therefore m is an observable (i.e. a function of the {s^}). We wish to 
simulate at fixed m (m is a parameter closely related to the average value of m). 
The canonical weight at inverse temperature fi and h = would be exp[—fiU] 



where U is the interaction energy. Instead, the tethered weight is (see Section 4.1 
for a derivation) 

WN{^,m;{sx})=e-^^+^^"\m-ni)^^-^^^^0{m-m). (3.1) 
The Heaviside step function 0(m — m) imposes the constraint that m > m{{sx})- 



The tethered simulations with weight (3.1 1 are exactly like a standard canonical 



Monte Carlo in every way (and the balance condition, etc.). For instance, in an 
Ising model setting, the common Metropolis algorithm jMET33] is 

1. Select a spin s^. 

2. The proposed change is flipping the spin, — ^ —Sx- 



^In an atomistic simulation, one would try to displace a particle, or maybe to change the volume 
of the simulation box. 



48 



Managing rugged free-energy landscapes: a Tethered Monte Carlo primer 



3. The change is accepted with probability]^ 

Visx -Sx) = min{l,a;new/a;oid}- (3-2) 

4. Select a new spin and repeat the process. We can either pick at random or 
run through the lattice sequentially. In the work reported in this dissertation 
we have always followed the second option, more numerically efficient. 

Once N spins have been updated (or we have run through the whole lattice, in the 
sequential case) we say we have completed one Monte Carlo Sweep (MCS). 

We remark that the above outlined algorithm produces a Markov chain entirely 
analogous to that of a standard, canonical Metropolis simulation. As such it has 
all the requisite properties of a Monte Carlo simulation (mainly reversibility and 
ergodicity). Tethered mean values can be computed as the time average along the 
simulation of the corresponding observables (such as internal energy, magnetisa- 
tion density, etc.). Statistical errors and autocorrelation times can be computed with 
standard techniques (see Appendices [A| and [B|). 

The actual magnetisation density is constrained (tethered) in this simulation, 
but it has some leeway (the Gaussian bath can absorb small variations in m). In 
fact, its fluctuations are crucial to compute an important dynamic function, whose 
introduction would seem completely unmotivated from a canonical point of view: 
the tethered field b 

^_ ldlogcoN{^,ih;{sx}) N-2 ^^^^ 



N dm 2N[m - m{{sx})] ' 

One of the main goals of a tethered simulation is the accurate computation of the 
expectation value (fc)m- 

The case where one wishes to consider two reaction coordinates mi and m2 is 
completely analogous: 

X (mi - mi)^^-^^^^0{nii - mi) (3.4) 
X (m2 - m2)(^"^)/^0(m2 - mz) , 



where 



£ 1 aioga;N(j6,mm2;{sx}) N-2 



N ami 2N[mi -mi({sx})] 

1 d\ogcoN{^,mm2; {sx}) ^ N-2 

N am2 ~ 2N[m2-m2({s;c})] 



^In general, in order to satisfy the balance condition (see Section A.i I we have to take into account 
both the weight of the current and proposed configurations and the probabilities of proposing this 
particular change and its reciprocal. However, the latter are trivial, because the change is always 

Sx -Sx- 
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For the Ising model, a Metropolis Tethered Monte Carlo simulation reconstructs 
the crucial tethered magnetic field h without critical slowing down (see Chapter [5] 
for a benchmarking study). This may be considered surprising for what is a local 
update algorithm, but notice that the constraint on in is imposed globally Non- 
magnetic observables, such as the energy, do not enjoy this non-local information 
and hence show a typical z ~ 2 critical slowing down (although the correlation 
times are low enough to permit equilibration for very large systems, see Chapter [5]). 

Let us stress that the above outlined update algorithm is by no means the only 
one possible. For instance, the Fortuin-Kasteleyn construction | |kas69[ IFOR72] can 
be performed just as easily in the tethered ensemble, so we can consider tethered 
simulations with cluster update methods |swe87[ IedwSSI [woLSgl l . We demonstrated 



this in I jMMog] (see also Chapter |6]), where the tethered version of the Swendsen- 
Wang algorithm was shown to have the same critical slowing down as the canonical 
one for the D = 3 Ising model (z ~ 0.47). This is an example that the use of the 
tethered formalism implies no constraints on the choice of Monte Carlo algorithm, 
nor does it hinder it in the case of an optimised method. 

3.2.2 Reconstructing the Helmholtz effective potential from sim- 
ulations at fixed m 



The steps in a TMC simulation are, then, (see also Figure 3^1] ) 



1. Identify the range of ih that covers the relevant region of phase space. Select 
Mm points rhi, evenly spaced along this region. 

2. For each perform a Monte Carlo simulation where the smooth reaction 
parameter m will be fixed at rh = rhi. 

3. We now have all the relevant physical observables as discretised functions of 
m. We denote these tethered averages at fixed fh by (O)rfi. 

4. The average values in the canonical ensemble, denoted by (O), can be recov- 
ered with a simple integration 

(O) = / dmp(m)(0)^. (3.7) 

In this equation the probability density p{rh) is 

Pin 

p(m) = e-^"~('^'/5), nN(m,/3) = O^im^in) + I dm' {h)^,. (3.8) 

J 777rnin 



The tethered field (fc)^ was defined in Eq. (3.3 1. The integration constant 



^Ni'i^mm) is chosen so that the probability is normalised. 
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L 




MCS 


-(") 


16 


91 


10^ 


1.03472(10) 


32 


91 


10^ 


1.007189(78) 


64 


109 


10^ 


0.996868(11) 


128 


50 


10*^ 


0.9929493(45) 



Table 3.1: Energy density of the D = 3 ferromagnetic Ising model computed with the 
Tethered Monte Carlo method, showing that the reconstruction of canonical averages can 
be performed with great accuracy. The second column shows the number of points in the 
m grid and the third the number of Monte Carlo sweeps taken on each (we use a cluster 
update scheme). Data from the simulations reported in Chapter [6| 



If we are interested in canonical averages in the presence of an external mag- 
netic field h, we do not have to run any new simulations. Indeed, we can 
reuse the {0)fh and only recompute (only the relative weight of the teth- 
ered averages changes). This is as simple as shifting the tethered magnetic 
field: {b)n^^ {b)n^-^h. 

In order to improve the precision and avoid systematic errors, we can run 
additional simulations in the region where p{m) is largest. 



The whole process is illustrated in Figure 3.1 where we compute the energy density 



at the critical temperature in an L = 64 lattice of the D = 3 Ising model. Notice that 
the tethered averages (w)^ vary in about 10% in our m range, but the computation 
of the effective potential is so precise that the averaged value for the energy, (u) = 
—0.996 868(11), has a relative error of only ^ 10~^. 

This is the general TMC algorithm for the computation of canonical averages 
from the Helmholtz potential. As we shall see in some of the applications, some- 
times the integration over all phase space in step 4 is not needed and one can 
use the ensemble equivalence property to recover the (O) from the (O)^, through 



saddle-point equations, remember Eq. (2.10 1. In other words, the tethered averages 



can be physically meaningful by themselves. For example, the crystallisation study 
of Iferiia] is built entirely over the effective potential, one never uses the p{m). 

As will be shown in Chapter Q the reconstruction of canonical averages from 
the combination of tethered averages does not involve any approximation. We can 
achieve any desired accuracy, provided we use a sufficiently dense grid in m (to 
control systematic errors) and simulate each point for a sufficiently long time (to 
reduce statistical ones). Table [3^ and Figure 3.1 show the kind of precisions that 



we can achieve. One could initially think that the computation of the exponential 
in p{rn) = exp[— Nil]v(m)] would produce unstable or imprecise results for large 
system sizes. Instead, the combination of self-averaging and no critical slowing 
down makes the numerical precision grow with N. 
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m 



Figure 3.1: Computation of the Helmholtz potential O^q and the canonical expectation 
values from tethered averages in a D = 3, L = 64 ferromagnetic Ising model at the critical 
temperature. Top; tethered magnetic field (we show only the positive tail, in order to 
see its structure better at this scale), with an inset zooming in on the region between its two 
zeros. The statistical errors cannot be seen at this scale (except for the leftmost points in 
the inset). The integral of this quantity is the Helmholtz potential Cl-t^{ni). Middle: p{m) = 
exp[—NO]\[{m)] in a linear (right axis) and in a logarithmic scale (left axis). Bottom: tethered 
expectation values of the energy density u. Their integral over the whole m range, weighted 
with p{ih), gives the canonical expectation value (m) = —0.996 868(11). (horizontal line). 
See Chapter [6| for further details on these simulations. 



Part II 

The Tethered Monte Carlo formalism, 
with a new look at f erromagnets 
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CHAPTER IV 



The tethered formalism 



This chapter presents the tethered formalism in detail, noting its relation to the 
canonical ensemble and introducing some of the techniques that we will use through- 
out this dissertation (such as saddle-point equations). We first presented the teth- 
ered statistical ensemble in |fero9c| , demonstrating its application to the Ising 
model (cf. Chapter [5]). The exposition in this chapter also uses the Ising ferromag- 
net as a model system, since that will be the first application considered in this 
thesis (it would be straightforward to reproduce the construction for a different 
model, as we shall see in Chapter |8]). However, the treatment of the tethered for- 
malism is otherwise more general than that of [pE Rogc] . For instance, we include 
details on how to consider several tethered variables (Section |4.i.i| >, a feature that 
we will need in our study of the DAFF (Chapter |8]). 



4.1 

The tethered ensemble 



As noted above, we consider the D-dimensional Ising model, characterised by the 
following partition function 



Z{fi,h) 



^ exp ^ ^ s^Sy + ^hJ^s, 



(4-1) 



(recall that the angle brackets indicate that the sum is restricted to first neighbours 
and that the spins are Sx = ±1). As we indicated in Chapter [ij we shall always 
consider square lattices of linear size L and periodic boundary conditions, so a 
system in D spatial dimensions will have N = nodes. The partition function 
includes an applied magnetic field h. In this chapter we will work at fixed fi. 
Hence, to lighten the expressions we shall drop the explicit j6 dependencies. This 
simplified notation is also employed throughout our applications. 
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The spin interaction energy and magnetisation of a given configuration are 
Ui{s^}) = Nu{{s^}) = - E s^Sy, M{{s^}) = Nm{{s:,}) = J^s^,, (4.2) 

Since this chapter is concerned with the construction of a new statistical ensemble, 
we have to be very precise with our notation. Therefore, we use sans-serif italics 
for random variables (i.e., functions of the spins) and serif italics for real numbers 
(e.g., expectation values or arguments of probability density functions). For future 
chapters, dedicated to physical results, we return to the usual convention and will 
no longer make this distinction explicit. 

For instance, we shall denote the expectation values in the canonical ensemble, 
for a given value of the applied magnetic field, by 

U{h) = Nu{h) = {U){h), M{h) = Nm{h) = {M){h). (4.3) 

As noted in Chapter [ij whenever a symbol has an uppercase and a lowercase 
version, they correspond to extensive and intensive quantities, respectively. These 
expectation values are weighted averages over the 2^ possible configurations {sx} 
of the system 

= zk E ^^^'^^^ exp[-fiU{{sx}) + fihM{{sx})]. (4.4) 

{Sx} 

Let us for the moment consider the case h = and use the shorthand 

{0) = {0){h = 0). (4.5) 

Since this is a ferromagnetic system, we may be interested in considering the av- 
erage value of conditioned to different magnetisation regions. The naive way of 
doing this would be 

{OS{Nm-M({s,}))) 

The canonical average could then be recovered by a weighted average of the {0\m), 

(0) =E(0|m) pi(m), (4.7) 

m 

where 

pi(m) = ^ E ^("^ - M{{sx})/N) exp[-fiU{{sx})] • (4.8) 

{Sx} 

In the thermodynamical limit, pi would be a smooth function, the logarithm of the 
effective potential associated to the reaction coordinate m (cf. Section 2.1.1 1. For 
a finite system, however, there are only N + 1 possible values of m, so pi{m) is a 
comb-like function. 
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We want to construct a statistical ensemble where a smooth effective potential 
can be defined in finite lattices. The first step is extending the configuration space 
with a bath /? of N demons and defining the smooth magnetisation M, 



M= M + R, 



(4-9) 



wher^ 



R = Evf/^- (4-10) 

i 

The demons are statistically independent from the spins and Gaussianly distributed 



Df] exp 



Notice that, due to the central limit theorem, the above probability distribution 
approaches a Gaussian of mean 1/2 and variance {2N)~^ in the large-N limit. 
Now our partition function is 



Z(0)= [Ufj Eexp[-^ty({s,})-J]7?/2 



(4.12) 



The convolution of pi{m) and p2{^) then gives the probability density function 
(pdf) for m, 

■p{m) = J dm J dr pi{m)p2{r)S{m — m — r). (4-13) 

So is essentially a smooth version ofpi(m — 1/2). Writing p{m) explicitly we 
have 



p{m] 



I Di^s(^rn-m{{s,})-J^rjf/{2N)^ (4.15) 



g-/3L'({s4)+M({s4)-Nm 



In the second step we have used the Dirac delta to simplify the integral over 
the demons, which has been reduced to the computation of the area of an N- 
dimensional sphere. We have, finally 



('27t)N/2 



r(N/2)Z(0) 



(4.16) 



where 



cvNimis,}) = e-/^^({«-^»+'^({«-^»-^[m - m({s;,})](^-2)/20(^ _ m{{s,})) . (4.17) 



^Actually, R can be defined in different ways, see Section 4.1.2 
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The Heaviside step function enforces the constraint m > m. We can now introduce 
the effective potential Ojsi 

^-NHM = p^^) = cj^ cv^im, {s,}). (4.18) 

{%} 

We want to construct the tethered statistical ensemble, where would be the 



basic physical quantity, instead of the free energy Fj^. Comparing (4.18) with (4.1) 
and ( |44[ >, we see that 0;^ is going to take the role of the tethered weight, just as 
exp[—fiU + fihM] in the canonical case. We therefore define the tethered expectation 
value as 

E{s,}^nK{Sx}) 

We can now rewrite the canonical average as 

(0) = j dm (0)^ p(m) = j dm (0)^ e-^"^^'^), (4.20) 

This has the structure of Eq. \\.6) , but now {0)fh and p(m) are smooth functions 
even for finite lattices. 

Suppose now that we want to reintroduce the applied field h. It is clear from 
that (0|m) [h) = (0|m). Then, computing {0){h) is just a matter of reweighting 
the different magnetisation sectors: 

{0){h)=J^{0\m)pi{m;h), (4.21) 

m 

where 

Pi(ni;h) = ^^J^S{m- m{{s,})) exp[-^U{{s,}) + ^hM{{s,})] . (4.22) 



1 

Analogously, in the tethered notation we would have 



{0){h) = /dm (0)^ e-^n,in^)+N^hn^ ^ ^^^^^ 

where the partition function can be written as 

Z{h) = e^^^^(^) = jdm e-w["N(m)-/3?im] _ (^^4) 

This expression illustrates the fact that the construction of the tethered ensemble 
from the canonical one is a Legendre transformation. We have replaced the free 
energy, a function of h, with the effective potential, a function of m. 
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In the canonical ensemble the derivative of Fj^ with respect to h defines the 
magnetisation, the basic observable. Analogously, in the tethered ensemble the fh- 
derivative of Oj^ is going to define the tethered magnetic field — recall Eq. ( |2.ii[ >. 
We differentiate in (|4.i8[) to obtain 



1/2-l/N 
m-m{{s^}) 



dm 



L{s,}C^N{m;{sx}) 



(4-25) 



Then, recalling the definition of tethered expectation values in (4.19 1 we define the 
tethered magnetic fielc^ 



k{sx}) = 1 



so that 



1/2-l/N 
m- m{{sx}) ' 



drn 



(4.26) 



(4-27) 



The tethered magnetic field is essentially a measure of the fluctuations in m, which 
illustrates the dual roles the magnetisation and magnetic field play in the canonical 
and tethered formalism. This is further demonstrated by the tethered fluctuation- 
dissipation formula 



d(0 



dm 



(4.28) 



This formula is easy to prove differentiating in (4.19). The values of m where 
{b)th = will define maxima and minima of the effective potential and, hence, of 
the probability p{m ). 

Definition (4.26), together with the condition that p(m) = e~N^N{m) ]-,g 
malised, allows us to reconstruct the Helmholtz potential from the tethered av- 
erages alone. Therefore, from the knowledge of the (0),^ we can obtain all the 
information about the system (at the working temperature (i), including the canon- 



ical averages for arbitrary applied fields. Notice, from (4.23), that considering a 



non-zero h is simply equivalent to shifting the tethered magnetic field, (6)^ — ^ 

The computation of the (0)^^, as detailed in the previous chapter, is just a matter 
of running several independent Monte Carlo simulations with the weight (4^17]) in 
a sufficiently dense grid of m. 

As a final comment, we note that our introduction of demons is reminiscent 
of Creutz's microcanonical algorithm ICRE83I, but there are several important dif- 
ferences: (i) we include one demon for each degree of freedom, (11) our demons 
are continuous variables and (iii) we explicitly integrate out the demons, finding a 
tractable effective Hamiltonian. Still, notice that one could also define a version of 



^In this thesis, we have changed the sign convention for flf^ with respect to that of [ jFERogcj . 
Therefore, our b would be the —hoi [FERogcj . 
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the tethered formalism where the demons are not integrated, but rather treated as 
actual dynamical variables. This has worse numerical performance (as we shall see, 
the non-local nature of the conservation law is crucial to break the critical slowing 
down), but could have advantages for non-standard computer architectures. 

4.1.1 Several tethered variables 

Throughout this section we have considered an ensemble with only one tethered 
quantity. However, as we shall see in Chapter |8| it is often appropriate to consider 
several reaction coordinates at the same time. The construction of the tethered 
ensemble in such a study presents no difficulties. We start by coupling reaction 
coordinates x,, i = l..n, with N demons each, 

Xi = Nxi = Xi + /?i , . . . , Xn = NXn=Xn + Rn- (4-29) 

We then follow the same steps of the previous section, with the consequence 
that the tethered magnetic field is now a conservative field computed from an 
n-dimensional potential D]v(x) 

VOn = (9x1 ^N, • • • ' 9x„^n) = B (4.30) 
B= ((6i)i,...,(6„)A), (4.31) 

and each of the 6, is of the same form as in the case with only one tethered variable. 



Similarly, the tethered weight of Eq. (4.17 1 is now, up to irrelevant constant factors. 



^n\^' M) ^ e ^^7{xi,Xi{{Sx})7{x2,X2{{Sx})) ■ ■ ■ 7{^n,Xn{{Sx}))- (4-32) 

where 

7(x;x) =e^(^-*)(f-x)(^-2)/20(x-x). (4.33) 



4.1.2 The Gaussian demons 

In the construction of the tethered ensemble we defined the bath of Gaussian 
demons as 

i 

adding the rjj quadratically to the spins. However, we could also have considered 
linear demons, for instance, 

R^"^^ = YLVr- (4-34) 



The whole construction could be followed in the same way, but we would have 

4^ 



b^^^ = m-m{{sx}). (4.36) 
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Furthermore, we have assumed there are as many demons as spins. While this 
choice seems natural, it is by no means a necessity In fact, if we were to consider 
an off-lattice system, the demons would increase the dispersion in the already con- 
tinuous pi(x). In order to control the fluctuations it is useful in such a case to 
consider a variable number of demons (see [|FERiiA j | ). Using the linear Z?^^-* we 
would have 

■y aN 



^« = - E ^i' (4-37) 

i=l 

= a(m - m). (4.38) 



4.2 

Ensemble equivalence 

In this chapter we have constructed the tethered ensemble, noting its relation to 
the canonical one. In particular, we have showed how to reconstruct canonical 
expectation values from the tethered averages as a function of rh. However, this 
is not always the ultimate goal. Throughout this dissertation we shall see several 
cases where we obtain physically relevant results without considering canonical 
averages. Good examples are the computation of the hyperscaling violations expo- 
nent 9 for the DAFF in Chapter |8] or the computation of the critical exponents ratio 
/3 / v in Chapters [5] and [6j 

Still, most of the time the averages in the canonical ensemble are the ones with 
an easiest physical interpretation (fixed temperature, fixed applied field, etc.). In 
principle, their computation implies reconstructing the whole effective potential 



Djv and using it to integrate over the whole coordinate space, as in (4.23). Some- 
times, however, the connection between the tethered and canonical ensembles is 
easier to make. Let us return to our ferromagnetic example, with a single tethered 
quantity m. Recalling the expression of the canonical partition function in terms of 
On for finite h, Eq. ( 4.241 , we see that the integral is clearly going to be dominated 



by a saddle point such that 

dm[^N{m) - ^hfh] =0 =^ {b)^ = p,h. (4.39) 

Clearly, this saddle point, the minimum of the effective potential, rapidly grows in 
importance with the system size N, to the point that we can write 

lim (0)(/z) = lim (0)^(;j). (4.40) 

That is, in the thermodynamical limit we can identify the canonical average for 
a given applied magnetic field h. with the tethered average computed at the sad- 
dle point defined by h, which is nothing more than the point where the tethered 
magnetic field coincides with the applied magnetic field (times p). 
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This ensemble equivalence property would be little more than a curiosity if it 
were not for the fact that the convergence is actually very fast (see Chapter [5] for 
a study). Therefore, in many practical applications the equivalence (4.40) can be 
made even for finite lattices (see Section 8.1.1 below for an example). 

The saddle-point approach can be applied to systems with spontaneous sym- 
metry breaking. In the typical analysis, one has to perform first a large-N limit and 
then a small-field one (this is troublesome for numerical work, where one usually 
has to consider non-analytical observables such as \m\). In the tethered ensemble 
we can implement this double limit in an elegant way by considering a restricted 
range in the reaction coordinate from the outset (see Chapter [5] for a straightfor- 
ward example in the Ising model and Section 8.1.1 for a more subtle one). 



CHAPTER V 



Tethered Monte Carlo study of the 
ferromagnetic Ising model 



This chapter presents a study of the D = 2 ferromagnetic Ising model carried out 
with the tethered formalism. This is the first system that we studied with TMC, 
as a demonstration of the method [ jFERogc] . It is intended as a step-by-step guide 
to TMC in a straightforward application as well as a demonstration of its power. 
We work in several regimes, covering most of the techniques that will be needed 
in a more sophisticated implementation (saddle-point equations, grid refinement, 
etc.). We also illustrate how TMC can provide complementary information and 
enable some new analysis techniques (see, for instance, the computation of /3/v in 



Section 5.2.21. 



The two-dimensional Ising model was solved by Onsager in 1944 joNS44] . Since 
then, many other exact results have been obtained (see | [MCC73i l for a review), mak- 
ing it the best understood model with a phase transition. Furthermore, the Ising 
model is the ideal setting for sophisticated Monte Carlo methods, particularly clus- 
ter algorithms (cf. Chapter |6]). Therefore, we can confront our results with accurate 
numerical computations even in those cases where the exact solution is unknown. 



5-1 



The model and observables 

We consider the two-dimensional Ising model, introduced in Chapter 4[ 
Z{h)=e''W) = J]exp[/3 £ s.Sy + fihl^^s 



(4-i> 



^As in Chapter Q we do not write explicit j6 dependencies, since we always work at constant 
temperature. 
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The infinite-volume system undergoes a second-order phase transition at a critical 
(inverse) temperature jSc 



log(l + x/2 



0.440 686 793509 771 



(5-1) 



The main observables we consider are the energy U and the magnetisation M, both 
defined on Eq. (4.2). Other interesting physical quantities are the specific heat C 
and the magnetic susceptibility Xi 



C = N[{u^ 
X2 = N[{m^ 



(m) 



(5-2) 
(5-3) 



The latter quantity can be generalised to define the higher cumulants of the 
magnetisation. 



X2n 



1 a2"iogz 



1 a2«f- 



N 



(5-4) 



h=0 



These can also be defined as the zero-momentum components of the 2n-potnt cor- 
relation functions. In particular, the two-point propagator is 



N 



(5-5) 



In the limit when A: — > 0, this function can be approximated by its free-field 
form jzj03} |XMi03j | 



G2{k) 



A 



D 



= 4:J^sin^ki/2. 

i=l 



(5-6) 



We can use the correlation function in position space to define a correlation length, 

— \x\ 



lim 



(5-7) 



'^''^ |x|^-oo logG2(x) 

In a finite lattice this definition is not practical but we can use the free-field approx- 
imation (5.6) to arrive at more easily measured ones. In particular, definition (5.7) 
would give ^exp = so we define 



Gijki) - G2{k2) 

KlG2{k2)-KlG2iki] 



(5-8) 



We can choose ki = (0,0,0) and k2 = k 

unit vectors in reciprocal space. With this choice, k^^ is the smallest non-zero 



{2n/L)ui, where is one of the 

(0 

min 
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momentum compatible with our periodic boundary conditions. We thus arrive at 
the second-moment correlation length ^2, 



^2 



2sm{n/L) 



X2 



1 



1/2 



D 



LG2{k^, (5-9) 



D 



i=l 



(remember that G2(0) = X2)- This definition of the correlation length was in- 
troduced in [|coo82 | | and has since then proved very useful in finite-size scaling 
studies |CAR93A[ |bal96[ [BALg/] . 

In the broken symmetry phase (or with non-zero h), definition (5.9 1 does not 
work and we have to use larger momenta (see Section 5.3). 

A final interesting observable is the Binder ratio, related to the fourth cumulant 
of the magnetisation. 



B 



[nr 



[m 



2\2 • 



(5.10) 



For the infinite model, the energy and specific heat can be shown to be (see. 



e.g., |hua87| | for a summary of the computation) 



u~(/3) 
C~(^) 



coth(2j6) 



1 + -k%(k) 
n 



^(/3coth2/3)2 [2Ki(k) - 2Ei(k) - (1 - k') + k%(k: 



(5-") 
(5-12) 



Here Ei(x) and Ki{x) are the complete elliptic integrals of the first and second 
kind (see, e.g., [IgraooI ) and k = 2sinh(2|6)/ cosh^(2|6), k' = 2tanh^(2|6) - 1. In 
addition, C. N. Yang IYAN32] showed that the spontaneous magnetisation is 



1 - (sinh(2|6))' 



1/8 



i6<i6c, 
. /3>/3c. 



(5-13) 



We shall also compare our results with the exact expressions for finite L computed 
by Ferdinand and Fisher in |fer69[ (too long to reproduce here). 



5-2 

Results at jS^, /z = 

We have used the TMC method, with the simple Metropolis implementation de- 
scribed in Chapter to simulate lattices L = 16,32,. . .,1024 at the critical point 
of the D = 2 ferromagnetic Istng model. Since the tethered formalism was al- 
ready constructed using the Istng model as an example, no further notes on it are 
needed here. However, we still have some practical decisions to make, regarding 
the numerical implementation: 
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Figure 5.1: Probability density function p{m) for several lattice sizes at the critical temper- 
ature. The peaks get closer as L grows, eventually merging in the thermodynamical limit. 



• We need to decide on a numerical interpolation scheme for the (O),^ and on 
a numerical integration method to obtain il^ from {h)^- 

• We need to decide which values of m to simulate. 

These issues are discussed in detail on Appendix |C| which also contains some 
practical recipes for the numerical implementation of the Metropolis algorithm and 
for the analysis of some delicate physical observables. Suffice it to say here that the 
issues of interpolation and integration are not critical (all the tethered averages 
being very smooth functions of m). 

As to the m sampling, it is best done starting from a uniform grid and, after 
a first analysis, perhaps refining the areas that are seen to have more weight in 
the p{m). In our case, for systems with L < 256 we have sampled the p{m) (see 
Figure 5.1 1 with 51 uniformly distributed points in the interval m G [—0.5,1.5] 
(except for L = 16, where the range had to be widened to avoid cutoff errors). For 
larger lattices the probability density drops much faster at the tails, so we have 
narrowed the range to m G [—0.3, 1.3] and, since the peaks are also steeper, we have 
then added another 26 points around the probability maxima (see Section C.2[ >. 

In all cases we have performed 10"^ MCS for each value of rn and computed 
the errors with 100 jackknife blocks (see Appendix |B] for our error estimation tech- 
niques), after discarding the first fifth of the measurements for thermalisation (al- 



though the correlation times are much smaller, see Section 5.5). 

In the following subsection we report the result of computing the canonical 
expectation values for several standard observables for zero magnetic field. We 
then demonstrate the new options afforded to us by TMC, by carrying out an 
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L 


- (u) 
\ / 






c 


B 


16 (TMC) 
16 (CAW) 
16 (E) 


1.45308(4) 

1.4529(2) 

1.453065... 


0.54543(6) 
0.5451(3) 


0.9116(2) 
0.9104(9) 


7.7186(14) 

7.718(10) 

7.717134... 


1.16562(7) 
1.1659(3) 


32 (TMC) 
32 (CSW) 
32 (E) 


1.43369(4) 

1.43367(12) 

1.433659... 


0.45900(10) 
0.4591(2) 


0.9072(4) 
0.9078(9) 


9.509(3) 

9.493(13) 

9.509379... 


1.16723(14) 
1.1671(3) 


64 (TMC) 
64 (CSW) 
64 (E) 


1.42397(4) 
1.42390(6) 
1.423938... 


0.386 19(18) 
0.3860(2) 


0.9065(9) 
0.9056(10) 


11.285(6) 

11.293(17) 

11.288138... 


1.1675(3) 
1.1677(4) 


128 (TMC) 
128 (CSW) 
128 (E) 


1.41905(5) 
1.41906(4) 
1.419076... 


0.3244(3) 
0.32459(17) 


0.9040(18) 
0.9048(10) 


13.063(10) 

13.06(2) 

13.060079... 


1.1684(7) 
1.1677(4) 


256 (TMC) 
256 (CSW) 
256 (E) 


1.41663(5) 
1.41664(2) 
1.416645... 


0.2728(6) 
0.27286(14) 


0.904(4) 
0.9042(9) 


14.83(2) 
14.83(2) 
14.828595... 


1.168 7(14) 
1.1682(4) 


512 (TMC) 
512 (CSW) 
512 (E) 


1.41542(4) 
1.415444(11) 

1.415429... 


0.2293(7) 
0.22968(13) 


0.903(6) 
0.9059(10) 


16.57(3) 
16.60(2) 
16.595404... 


1.168(2) 
1.1676(4) 


1024 (TMC) 
1024 (CSW) 
1024 (E) 


1.41489(4) 

1.414826(6) 

1.414821... 


0.1949(15) 
0.19307(12) 


0.919(15) 
0.9046(11) 


18.28(8) 
18.35(3) 
18.361 348. . . 


1.163(6) 
1.1681(4) 



Table 5.1: Results at the critical temperature and comparison with a cluster algorithm. 
(TMC): Tethered Monte Carlo, (CSW): Canonical Swendsen-Wang, (E): Exact results at finite 
L from [ |fer69( . 



unconventional, but very precise, computation of the anomalous dimension of the 
system from the position of the peak in p(m). In Section 5.3 we shall reweight these 
simulations to obtain canonical expectation values at non-zero magnetic field. 

5.2.1 Computation of the canonical expectation values 



Our first physical result is the pdf of m, represented in Figure 5.1 This distribution 
features a central minimum in the zero-magnetisation region (recall that /fi ~ m + 
1/2) as well as two symmetric peaks that get closer together as L grows. Of course, 
since this system experiences a second-order phase transition, the peaks should 
converge in the thermodynamical limit (where the effective potential is a convex 
function). 
In Table 



5-1 



we report our estimates for the canonical expectation values of sev- 
eral standard observables. We check our results against the values obtained from 
the exact formulas in |fer69[ and against a canonical Swendsen-Wang simulation. 
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We use our own implementation, based on the one distributed with IAM103] , al- 
though our results are compatible with those of [iSALOoil . We perform 10'' Swendsen- 
Wang cluster updates. 

From Table 5.1 we can confirm that a simple implementation of TMC is capable 
of producing very accurate results. The relative errors for X2 arid B scale as L. This 
can be explained by notic ing th at both are completely determined by p(m) (see the 
discussion following Eq. (c.ii I in Appendix Cl. In addition, b is self-averaging (cf. 
Section 2.2.2) and, as we shall see in Section 5.5 virtually free of critical slowing 
down (meaning that for a fixed simulation length its error scales as 1/\/N). Finally, 
in the computation of p(m) we are multiplying Djv(m) by a factor of N, yielding 
an overall \/ N scaling for the errors. 

Of course, TMC is not meant to be a competitor to cluster algorithms for the 
Ising model without magnetic field. For example, the CPU time to compute each 
of the 77 simulations at fixed m for L = 1024 is similar to what we needed for 
the whole Swendsen-Wang simulation, which is also more precise. See, however. 
Chapter |6] for a tethered implementation of the Swendsen-Wang algorithm, which 
turns out to be as efficient as the canonical Swendsen-Wang for the Ising model. In 
any case, our tethered version of the Metropolis algorithm is much more efficient 
than canonical Metropolis (see Section [5^. 



5.2.2 The magnetic critical exponent 

Let us now see an example of the new kind of analyses afforded to us by the 
tethered formalism. In particular, we shall compute the critical parameter j6/v 



(recall the definitions of the critical exponents in 2.1.2) in a very simple way that, 
however, would not be practical in a canonical simulation. 



We consider the finite-size scaling formula (recall Section 2.1.3) 



O) [h] = L-yofy foiL'/^t^LV'-h) + ... 



h ■ 



(5-14) 



We have included a second variable in the scaling function /q to allow for dis- 
placements in the applied field h, not only in temperature. These are regulated by 
a new critical exponent (analogous io yt = 1 / v). The dots represent possible 
corrections to scaling. For the moment, we shall work with no field, h = 0. 

Applied to the magnetisation, whose associated critical exponent is /3, the FSS 
ansatz implies that 

pi(m,/3c;L) = L/^/7(L/5/-m), (5.15) 

where p is a smooth version of pi{m; L). Now, the pdf of m, p(m; L) — Eq. ( 4.13I ) — 
is precisely a smooth version ofpi(m — 1/2). Therefore, we can write 



p(m,^c;L) = L^/7(L^/^(m-l/2)). 



(5.16) 



If we concentrate on the peaks of the pdf, we see that their height is going to grow 
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- - 1 
m 



peak ~ '"peak 2 '"peak ~ '"peak 2 



32 0.76401(10) 0.76431(11) 

64 0.70286(18) 0.703 0(2) 

128 0.6453(3) 0.6451(4) 

256 0.5921(7) 0.5910(7) 

512 0.541 9(12) 0.542 7(9) 

1024 0.499(2) 0.500(2) 

Table 5.2: Position of the positive and negative maxima of the p{m). Since we are at the 
critical temperature of a second-order phase transition, both columns should extrapolate to 
zero in the large-L limit (cf. our study of the ferromagnetic region in Section [54j |. 



as L^^^, while their position is going to shift as 

l^peak-^l-^^"^'^- (5-17) 

Now, computing the maximum of a numerical function is usually a delicate oper- 
ation. However, in our case we have measured directly b, which is the derivative 
of (the logarithm of) p{rh). Herein lies our advantage with respect to a canonical 
simulation (which cannot access b directly), because we only have to find a zero, 
a much better conditioned operation. In particular, we run through our computed 
tethered averages (&)^. until we find two consecutive points in the grid such that 
(b)^. > and {b)yni+i ^ ^- Then, we find the single zero of the section of our cubic 
spline (see Appendix |C]) that joins both points. This operation is performed for 
each jackknife block in order to estimate the statistical error (cf. Appendix |B]). The 



resulting maxima are collected in Table 5.2 



Notice that if we just wanted to obtain the maxima very precisely, we would not 
have performed a simulation of the whole m range. Instead, we would have done 
a fast sweep to place the peaks approximately and then we would have simulated 
only a few points in their neighbourhood, to a very high precision. However, even 
with our suboptimal grid we have determined the position of the maxima with a 
relative ranging from ^ lO"** for L = 3 2 to ^ 5 x 10""^ for L = 1024. 



We have fitted the J^pg-^j, to (5. 17 1, for different fitting ranges L > L^in (Ta- 
ble 5.3 1. The fits are good (in the sense of an acceptable / d.o.f., see Appendix |b]) 
for Ljnin > 64. Still, there are possible systematic sources of error (corrections to 
scaling), so, following | |bal96| |, we give as our final estimate the fit for Lj^m = 64, 
but with the larger error of the fit for = 128. For the negative magnetisa- 

tion peak we find ^/v = 0.1239(11) and for the positive peak ^/v = 0.1245(10). 
Both are compatible with the exact value for the D = 2 Ising model, known to be 
^/v = 1/8. 

Since we have a wide range of lattice sizes, we can try to characterise the first 
corrections to scaling. Following [|saloo1 , we assume that in the D = 2 Ising model 
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"^peak 










X^/d.oi. 




X^/d.oi. 


32 


0.1217(3) 


23.44/4 


0.1224(3) 


27.85/4 


64 


0.1239(5) 


2.027/3 


0.1245(5) 


2.087/3 


128 


0.1250(11) 


0.7569/2 


0.1246(10) 


2.053/2 


256 


0.126(4) 


0.6456/1 


0.1220(23) 


0.3248/1 



Table 5.3: Fits of w^peak(^) '5-^7 ' order to find ^/v, for different fitting ranges 
^ > i-min- For each fit we give the cni-square estimator and the degrees of freedom (see 
AppendixlBl. Our results converge to the exact value, fi/v = 0.125. 



the dominant corrections to scaling are analytical 

^peak = ^~^^'[^^ + A = 7/4. (5.18) 

We have fitted our points for all lattices to this expression, fixing the exponents to 
their exact values and varying A± and B±. We have obtained x^/d.o.f. = 0.9858/4 
for the negative peak and x^/ d.o.f. = 2.825/4 for the positive one. 



5-3 

Results at ^^,h^0 

One of the strengths of TMC is that it can compute the canonical expectation values 
for an arbitrary applied field h without any need for new simulations. Notice that 
in the canonical formalism not only would we have to run a new simulation for 
each value of h, but we would also lose the possibility of using the highly efficient 
cluster methods. One simply has to change the weight of the tethered expectation 



values when integrating over m, as indicated by Eq. (4.23 1. Figure 5.2 shows an 



example of this reweighting. We plot the pdf p(m;h) for several values of the 



applied field. By integrating the same tethered averages computed in Section 5.2 
with this new pdf we can obtain the canonical average at the corresponding applied 
magnetic field. 

Notice that now only a very narrow region has any significant contribution to 
the canonical average, which implies a loss of statistical precision. In general, if 
one is interested in the canonical average at a particular value of h, the m grid 
should be chosen so that the resulting peak is appropriately sampled, neglecting 
the exponentially suppressed region away from it. This is a simple task, because 
the position of the peak is easily determined from a first trial run, as we saw in 
the previous section. The only difference is that now we do not have to solve for 
{b)^ = 0, but for 

{b)nr = fih. (5.19) 
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Figure 5.2: Probability density function p(m,h) for L = 128 at j6c/ for several values of the 
applied magnetic field h. 



Once the peak has been accurately placed, the best strategy is running long tethered 
simulations precisely there and at neighbouring points. 

In this section, however, we are not interested in any particular value of h, but 
rather in studying the behaviour of the canonical averages {0){h) as smooth func- 
tions of the magnetic field. The simulations we already have will be sufficient, 
accepting a small loss of precision, because the m grid will not be necessarily op- 
timised. We can improve our precision somewhat, though, if we notice that most 
interesting observables are either odd or even functions of the applied magnetic 
field. Therefore, we can (anti)symmetrise the curves: 



For statistically independent data, averaging two equivalent estimates in this way 
would yield an error reduction of 1 / y/2. But this is not the case here: the individual 
averages for ±h are very strongly correlated. Therefore, the error reduction for 
even quantities is negligible. For odd quantities, on the other hand, since we are 
computing a difference, the fluctuations are greatly suppressed and the resulting 
error reduction is very large (around a factor of 10, specially for small values of h). 
We shall use equations (5.20I and (5.21), but dropping the explicit 'odd' or 'even' 
superscripts. 
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5.3.1 The magnetisation 

We first consider the most straightforward observable, the magnetisation {m){h). 
This is an odd function of h, so as discussed above we can greatly increase our 



precision by using formula (5.20). 



In this case, unlike previous sections, no data for {m){h) are readily available 
in a finite lattice, since canonical cluster methods lose much of their power in the 
presence of a magnetic field (in fact, current numerical methods for the investi- 
gation of the D = 2 Ising model in a field typically rely on transfer matrix tech- 
niques licASOoi |GRi03f |CAS04] ). Therefore, rather than compare our results directly 
with other computations, we shall carry out self-consistency checks. 



The first step is recalling Eq. (54), according to which the free energy of the 
system (an even function of h, for obvious symmetry considerations) can be written 
as a Taylor expansion in the following way 

00 

f^,(h)-f^,(0)= E^W"- (5-22) 
Therefore, the magnetisation (m) [h) is simply 

{m) ih) = ~=X2fih + |(/3^)3 + ^{fihf + + ... (5.23) 

Now, we can compute Xin the cumulants of the magnetisation ath = 0, but this 
equation provides an alternative way We generate a reasonable number of points 



of the (m) (h) curve, which we then parameterise with a truncated version of (5.23 1. 
The choice of values for h is somewhat delicate: if we use very small magnetic 
fields we will only be able to appreciate the first few coefficients but if we go too 
far in h we would need to have sampled the tails of the pdf of m very precisely 
We have found that magnetic fields up to fih ^ {X2)~^ provide a good compromise 
(notice that this value is L-dependent). 

As we discuss in Appendix |B| the correlation among the points in the curve has 
both a beneficial and a detrimental effect. The former is that, since all the points 
fluctuate more or less coherently, the resulting curves are very smooth. The latter 
is that these same correlations make it very difficult to estimate errors or to check 
whether a proposed fitting function is a good model for the data. In this section 
we have taken the approach of computing an odd interpolating polynomial with 
a finite-difference formula, which gives us as many xin as we have points. We 
estimate the statistical errors with the jackknife method. We still face a systematic 
error, which we try to control by varying both the range in h and the number of 
points. We have found that the last one or two coefficients in the fit are typically 
unstable. Therefore, if we want to obtain n physically meaningful parameters, we 
should compute at least n + 2 points. In our case, we have computed the non- 
linear susceptibilities up to X8> so, to be safe, we have used 7 points for each lattice 
size. These were equally spaced at intervals of A(/3/z) = {10x2)~^> where Xi is the 
susceptibility computed in the simulation at /z = 0. 



5-3 — Results at /3c, 7^ 



73 



L 


N-'X2 






N-'X8 


16 (M) 


0.54543(6) 


-0.545 72(13) 


2.265 7(8) 


-20.059(10) 


16 (F) 


0.54543(6) 


-0.545 7(2) 


2.262 8(19) 


-19.70(14) 


32 (M) 


0.45900(10) 


-0.3861(2) 


1.348 5(10) 


-10.042(10) 


32 (F) 


0.45900(10) 


-0.386 2(3) 


1.3484(18) 


-10.00(2) 


64 (M) 


0.38619(18) 


-0.273 3(3) 


0.803 1(13) 


-5.032(11) 


64 (F) 


0.38619(18) 


-0.273 8(5) 


0.805(2) 


-5.02(2) 


128 (M) 


0.3244(3) 


-0.192 8(4) 


0.475 8(17) 


-2.504(12) 


128 (F) 


0.3244(3) 


-0.1943(7) 


0.481(3) 


-2.52(2) 


256 (M) 


0.2728(6) 


-0.136 2(7) 


0.283(2) 


-1.250(12) 


256 (F) 


0.2725(6) 


-0.135 8(15) 


0.280(6) 


-1.20(4) 


512 (M) 


0.2293(7) 


-0.0964(7) 


0.168(2) 


-0.625(9) 


512 (F) 


0.2293(7) 


-0.096 0(12) 


0.166(4) 


-0.60(2) 


1024 (M) 


0.1949(15) 


-0.069 8(13) 


0.104(3) 


-0.328(12) 


1024 (F) 


0.194(3) 


-0.063(6) 


0.08(2) 


-0.2(3) 



Table 5.4: Non-linear susceptibilities from direct measurements at h = (M) and from a 
finite-difference formula for {m){h) as a function of the magnetic field (F). 



In Table 5.4 we compare the non-linear susceptibilities computed from mea- 
surements of the cumulants aih = Q and from the {m){h) curve. Both series are 
compatible, but the former are more precise. 

We can also perform a FSS analysis. For small applied fields we can collapse the 
curves for different sizes by plotting {m){h)/xi against ^h. As h grows, however, 
we start to appreciate the deviations from linear behaviour computed above (Fig- 
ure |5.3| — left). We can attempt a better collapse of the curves for different L with 



the FSS formula (remember Section 5.2.2) 



(m)(7z) ~ L-f^'^fmiiy^h), = 15/8. (5.24) 

We have plotted {m)L^^^ against hLV^' on Figure [5 .3] — right. We also include a 
fit for the scaling function fm- We use an odd seventh-degree polynomial and 
include in the fit the points for L > 64. The value of the diagonal ^^^^ 
X^/ d.o.L = 41.85/31 (see Appendix |b] for definitions). The last point in the curve 
starts to show a deviation, probably due to corrections to leading order scaling. 
Remember that we spaced our values of fih in units of {10x2)~^> which is not an 
optimal choice for a FSS study. 

We complete our analysis of {m)(h) by considering now L-independent mag- 



netic fields across several orders of magnitude (Figure 5.4). We observe two well- 
differentiated scales: a FSS regime, where the slope of the curve is very large, and 
a saturation regime where the curves merge. Notice that the only hard limit in the 



values of h we can consider is given by the saddle-point equation ( 5.i9[ >. So long 
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1= 512 
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Figure 5.3: Magnetisation as function of the applied magnetic field. In the left panel 
we have scaled the graphs by the linear susceptibility at zero field, Xii^)/ ^rid appreciate 
deviations for large fields. In the right panel we show a scaling plot, with a much better 
collapse. The continuous line is a fit for the scaling function /„„ Eq. ( 5.24I , using a seventh 
degree polynomial, with x^/d.oi = 41.85/31. 




Figure 5.4: Magnetisation as a function of the applied field for several lattices at j6c. We 
use an L-independent range for h (notice the logarithmic scale of the horizontal axis). 
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1 = 64 
1= 128 
1= 256 
L= 512 
L= 1024 
Scaling curve 




Figure 5.5: Correlation length ^'{h)/L, Eq. (5.25 1, against the scaling variable ^JiLy''. We 
also plot a fit to the scaling function f^t of ( 5.26[ l. 



as is contained within the bounds set by our measured values of the peak 
in p{in;h) will be contained in our simulated range and we can interpolate it with 
some precision (this is the reason for the rapid error growth in our largest lattices). 
Notice that the displacement of the curves in the FSS section seems linear in log L. 
This is an artifact of the low value of/3/v = l/8in Eq. (5.24 1. 



5.3.2 The correlation length 

We consider now the correlation length of the system for non-zero applied field. 



As we discussed in Section 5.1 the second-moment definition (5.9 1 does not work 



in an applied field, so we have to go back to the generic formula (5.8) and choose 
different ki and k2- The simplest choice is to use the next smallest momenta: 
^\ = kmin and ^2 = 2n/L{\,±l), 



2sin(7t/L) 



L2G2(/C2)-G2(/C 

r 



1/2 



(5-25) 



Now, ^'(/z) is an even function of h, so we symmetrise it with (5.21 1. Again, due to 
the lack of readily available data in a finite lattice for this observable, we check our 
results with a FSS analysis. To leading order, we have 



(5.26) 



where, as in the case of m, /^/ is expected to be very smooth. As we can see in 
Fig. 5.5 Eq. (5.261 is perfectly valid in our case, if we discard the data for L < 
32. At a first glance, it may seem that we have even overestimated our errors 
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for L = 1024, but remember that the points are very strongly correlated. The 
universal scaling curve is well represented by a sixth order even polynomial, with 
x\/ d.o.f. = 3.978/36. Thus, the universal scaling function f^^^^x) for x < 1.5 is very 
well approximated by 

f^i (x) = flo + ^ix^ + Hx'^ + a^x^, (5-27) 

with 

ao = 0.2399(2), ^2 = -0.0639(4), ^4 = 0.0235(7), ^6 = -0.0045(4). (5.28) 

We have estimated the errors in the fit parameters with the techniques of Ap- 
pendix |B] 



54 

The ordered phase: spontaneous symmetry breaking 

In this section we present new simulations of the D = 2 Ising model in the ferro- 
magnetic phase, using them to illustrate the treatment of spontaneous symmetry 
breaking in the tethered formalism. We then examine the approach to the ther- 
modynamical limit, studying the equivalence between the canonical and tethered 
ensembles. 



54.1 Spontaneous symmetry breaking 

Let us now consider the /3 > /3c regime. In this situation, the infinite system shows 
a nonzero expectation value for the order parameter, (m) 7^ 0, even in the absence 



of an external magnetic field, Eq. (5.13 1. This may seem incompatible with the 



partition function (4.1), where the configurations {s^} and {— s^} occur with equal 



probability. The well-known solution for this apparent paradox is spontaneous 
symmetry breaking, see, e.g., | hua87[ |zj03j , whose mathematical formulation in- 
volves considering a small magnetic field (which establishes a preferred direction) 
and taking the double limit 



[m 



lim lim {m 



XL) 



(h). 



(5-29) 



The order of the two limits is crucial: were we to reverse it, the magnetisation 
would always vanish. We see then that the symmetry of our model complicates the 
definition of a broken symmetry phase for finite lattices in the canonical ensemble. 
The traditional workaround consists in considering not the magnetisation m, but 
its absolute value \m\. 

The tethered ensemble provides a cleaner concept of broken symmetry phase. 



Consider the pdf of w, as in Fig. 5.1 In the ferromagnetic phase the corresponding 
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Figure 5.6: Computation at /3 = 0.4473 > f^c (ferromagnetic phase). The peak of the pdf of 
m gets narrower and closer to my + 1 /2 as L increases (the vertical line, where my is Yang's 
magnetisation for the infinite system at /3 = 0.4473). Compare the scale of the OX axis and 
the height of the peaks with those of Fig. 5.1 



L 


^points 


Am 


-("> 


C 




{m) 


128 (T) 
128 (E) 


90 


0.9 


1.490397(18) 
1.490409 763... 


8.874(4) 
8.877363... 


10.394(17) 


0.71934(6) 


256 (T) 
256 (E) 


79 


0.39 


1.490407(11) 
1.490415672... 


8.869(5) 
8.874075... 


11.26(4) 


0.71941(4) 


512 (T) 
512 (E) 


27 


0.13 


1.490419(5) 
1.490415689... 


8.877(5) 
8.874 046... 


11.5(3) 


0.71945(3) 


1024 (T) 
1024 (E) 


27 


0.13 


1.490416(4) 
1.490415689... 


8.868(7) 
8.874 046... 


11.4(18) 


0.71945(2) 


~(E) 






1.490415689... 


8.874046... 




0.719436... 



Table 5.5: Canonical averages for several physical quantities of an Ising lattice at j6 = 0.4473 
computed with the tethered method (T). The grid of m values is uniform in the narrow 
simulated band. Also included are the exact results for finite lattices from [FER69I and the 
exact results in the thermodynamical limit from Eqs. (3.11 - 3I13I 1. We appreciate that by 
simulating only a very small range Am of values for m we can obtain very precise values. 
Within our error, we have already reached the thermodynamical limit for L = 512. 
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graph will again have two peaks, but now these will be much narrower and higher, 
approaching two Dirac deltas in thermodynamic limit. Suppose we want to per- 
form the double limit of equation ([5^29]). This would involve introducing a small 
magnetic field which would shift the origin of h (4.23 1. The neighbourhood of one 
of the peaks would then become exponentially suppressed and eventually disap- 
pear in the thermodynamic al limit. Thus, we can mimic the effect of equation (5.29) 
by considering only one of the two peaks from the outset. This would not work 
at or below /3c, as there the peaks extrapolate to m = (recall Section 5.2.2). This 
procedure has the considerable advantage that it works for any lattice size. In this 
section we have chosen the peak of positive magnetisation. 

We have run simulations for lattice sizes L = 128, 256, 512, 1024 at inverse tem- 
perature /3 = 0.4473 > j6c. We chose this temperature as we estimated that the 
correlation length would be around ^' ~ lOj^ 

Following the previous discussion, we have worked in the m > 0.5 (positive 
magnetisation) region, where there is only one peak. An appropriate sampling of 
m is even more important in this phase (but easier to optimise) than in the situation 
described in detail in Section C.2 The reason is that the peak is now so narrow that 
a choice of m spaced as in the aforementioned section would not only be completely 
wasteful, but may also completely fail to sample the peak (remember the loss of 
precision for very high magnetic fields in Section [53] >- 

In the case of the Ising model, we know Yang's exact solution m°°(/3) for the 
magnetisation of the infinite system, Eq. (5.13 ). The positive peak for p{m) will then 
be very close to m°°(/3) + | and get closer as we increase L. With this information in 
hand, we can adequately reconstruct the effective potential by running simulations 
in a small neighbourhood of m°°(^) + j. For a different model, where we would 
lack the knowledge of the peak's position in the thermodynamic al limit, we can just 
run simulations with a very fine grid for some small and essentially costless lattice 
size and infer from them an efficient distribution of points for the larger systems. 

We have represented p{m) for all the simulated lattices in Fig. 5.6 which plots 
the whole simulated range of m for L > 512 (for the smaller lattices we have used 
a somewhat larger interval). It is interesting to compare the scale on the axes with 
that of Fig. 5.1 As will be discussed in detail in Section 5.4.2 the peak approaches 

Table 



m 



'(/3) + 2 (the vertical line) as L increases. 



5-5 



compares the values of the 



energy and specific heat obtained in our simulations with the exact values given 
in | |fer69[ . Notice how very small simulated ranges of m (Am = rhmax — ^min) yield 
very accurate results. In fact, we can see that for the L = 1024 lattice we obtain a 
more precise determination for the energy with 27 points than what we obtained 
at the critical temperature with 77 (we still perform lO'' Monte Carlo sweeps in 
each point). This result is even more impressive if we consider that some of these 
27 points, being deeply inside the tails of the distribution, do not have any effect 
whatsoever in the average with our error (of course, we do not know this until we 
have run the simulation and seen the actual width of the peak). 



^In this section we again have to use Eq. 1 5.25 1, instead of ^2, Eq. 1 5.9 1. 
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From Table 5.5 we can conclude that the thermodynamical limit has already 
been reached for L = 256, at least to the level indicated by our errors. Our whole 
computation for L = 512 required about 270 hours of computer time. For com- 
parison, a Swendsen-Wang computation with a run time of 30 hours for L = 512 
gives ^' = 11.8(2). We see that the ratio of computation time for both methods has 
changed significantly from the critical point, where the advantage of the cluster 
algorithm was much greater. 



54.2 Ensemble equivalence 

Once we wander away from the critical point, the main goal is finding the value of 
physical quantities in the thermodynamical limit, rather than attempting a finite- 



size scaling. The ensemble equivalence property discussed in Section 4.2 suggests 



a way to reach this limit without constructing the whole canonical p{m), but by 
concentrating instead on its maximum. From the computational point of view, this 
supposes a dramatic reduction in the needed effort for a TMC simulation. 
Ensemble equivalence can be expressed in mathematical terms by 



lim 



io) 



lim 



io) 



^peak ' 



(5-30) 



where rhpeak is the saddle point of Section 4.2 (for zero field, in this case). This 



equation can be understood as a more formal way of summarising the behaviour 



of Fig. 5.6 Indeed, we saw in the previous section that we could reconstruct the 
canonical averages considering only a very narrow range of m; in the thermody- 
namic limit a single point would be sufficient. 

For the Ising model we know exactly where this point would be situated, be- 



cause, from Yang's spontaneous magnetisation (5.13 1 



m~(/3) = lim (m) + 1 



1 - (sinh2|6)' 



1/8 1 
+ 2- 



(5-31) 



We could then run simulations for several lattice sizes precisely at m°° and study 
the evolution of (0)^^00 as we increase L. This is not the most practical approach, as 
for a model other than the D = 2 Ising lattice we would not know the position of 
the peak beforehand. Instead, we will follow a more general analysis that would 
work in more complex situations. 

Let us consider the canonical average of some quantity and recall that we are 
using periodic boundary conditions, so the approach to the thermodynamical limit 
is exponential 



/■oo 

(0)= / dmp(m,/3;L)(0),^ = 0~ + Aoe- 

Jl/2 



(5-32) 



where Aq is a constant amplitude. We have just considered the positive magneti- 
sation peak. 
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L 




j6 = 0.4473 




&~ ■ 10^ 


^peak 


"peak 


16 


2984(3) 


1.34384(4) 


1.57707(8) 


32 


924.7(16) 


1.29930(7) 


1.52898(9) 


64 


284.9(9) 


1.26333(7) 


1.50548(5) 


128 


86.0(6) 


1.23988(9) 


1.49563(4) 


256 


25.5(3) 


1.22732(8) 


1.491 99(2) 


512 


6.9(2) 


1.22204(6) 


1.490859(16) 


1024 


2.11(11) 


1.22024(4) 


1.490538(9) 


00 





1.219435... 


1.490416... 



L 




j6 = 0.6 






&~ ■ 10^ 


^iak 


"peak 


16 
32 
64 
128 
256 
512 
1024 


-290.6(14) 
-52.5(8) 
-11.0(4) 
-2.45(19) 
-0.67(11) 
-0.19(5) 
-0.03(2) 


1.471943(7) 
1.473 299(5) 
1.473 543(2) 
1.473 5940(12) 
1.473 6047(2) 
1.473 6077(4) 
1.473 608 3(2) 


1.91298(4) 

1.91018(2) 

1.909374(9) 

1.909165(5) 

1.909107(3) 

1.909090 7(15) 

1.909086 7(4) 


00 





1.473 608 7... 


1.9090862... 



Table 5.6: Tethered mean values of several parameters at the peak of the probability density 
function for /3 = 0.4473 and /3 = 0.6, together with the value of &~ = (this observable 

is zero at the peak and helps characterise how close we are to it). The exact value for an 
infinite lattice, which coincides with the canonical average, is also included for comparison. 



The integral will be dominated by a saddle point at J^ipg^^j,, with Jfipg^j^ ^^°°> m°°. 



so we can approximate the pdf by a Gaussian 

p(m,j6;L) 



N 



27tA;2 



exp 



N{m - mpeak) 



2X2 



(5-33) 



Therefore, we exp ect th e tethered average of O at this saddle point to approach the 
canonical average (5.32), with a correction of order N~^, 



(O)-Aoe-^/f- = (O)„,.^^ + 0(L 



-D 



(5-34) 



To ease the notation we shall use the definition 



(5-35) 
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This simple analysis provides a practical way of approaching the thermodynamic 
limit without knowing the limiting position of the peak in advance. 

We first run a complete simulation for some small lattice, covering the whole 
range of m. This provides a first approximation to the position of the peak. For 
growing lattices, we just compute two or three points at both sides of where we 
think the maximum is going to be. Our objective is not to reconstruct the whole 
peak of p{m), just to find a good approximation to the point Jfipg^^, where 



vanishes. We use the same procedure as in Section 5.2.2 finding the zero of the 
cubic spline and interpolating the physical observables. Actually, if the position of 
the peak is sufficiently bounded we could just place one point very closely at either 
side and use a linear interpolation. 

With this procedure we are able to compute the tethered mean values of the rel- 
evant physical quantities at the peak with a minimum of numerical effort. Here we 
shall apply this method to the energy and we shall also characterise the approach 
of the peak to m~. To the latter purpose, we have computed = (fc)^co for several 
lattice sizes and studied how fast it approaches zero. We also give the values for 
the position of the peak (Table 5^. 



Following the above analysis we should find that 

l"peak W I - / 



I + ^ CO I 



Ani-L (5.36) 



with ^ ~ 2. We present in Table 5.7 the result of applying the quotients method 
(see Section 8.5 1 to these observables. As we can see, our results are always < 2, 
even though this exponent grows with L. 

We believe this was caused by the proximity of the critical point, so we ran anal- 
ogous simulations for j6 = 0.6. We were able to complete this new computations 
in very little time, following the above procedure. For example, for L = 512 the 
position of the peak was so tightly bounded that we just computed one point at 
either side. 



Comparing Table 5.6 with Table 5.1 we see that for j6 = 0.6, with a computation 
effort almost 40 times smaller, we have obtained a result an order of magnitude 
more precise than what we had at fic- Recomputing the effective exponents for 
these new simulations we obtain results compatible with ^ = 2. Notice that for this 
temperature the error in the exponents is much bigger than that for j6 = 0.4473. The 



reason is clear from Table 5.6 The left-hand sides of (3^3^ are now much closer to 



zero than in /3 = 0.4473, yet their errors are only slightly smaller. In the computa- 
tion of the effective exponents only the relative errors matter, which explains our 
bigger uncertainties. Notice, however, that we have been able to distinguish val- 
ues for b°° of order 10~^ from zero and that we have located the peak with seven 
significant figures. 
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L 




a n A All. 

p — U.'l^/O 






p — U.D 
















16 


1.690(3) 


0.6394(14) 


1.168(4) 


2.47(2) 


2.43(2) 


1.83(3) 


32 


1.699(5) 


0.864(3) 


1.356(6) 


2.25(5) 


2.24(5) 


1.93(5) 


64 


1.729(12) 


1.102(7) 


1.531(12) 


2.17(12) 


2.16(13) 


1.87(10) 


128 


1.75(2) 


1.375(16) 


1.73(3) 


1.9(3) 


1.89(13) 


1.9(2) 


256 


1.88(5) 


1.60(4) 


1.83(6) 


1.8(4) 


2.0(5) 


2.2(5) 


512 


1.71(9) 


1.70(8) 


1.85(12) 


2.7(12) 


1.4(10) 


3.1(12) 



Table 5.7: Rate at which several observables approach zero. We consider a functional 
form A ■ L^^ and compute the effective exponent ^ from the ratio of the computed values 
at consecutive lattice sizes. We consider three exponents, ^g, ^rf, and for the evolution 
of b"^, ?«pga]^ and u^g^^, respectively, see (5.361. We observe that for |S = 0.6 the effective 
exponent approaches 2, as expected fromi the discussion in the text, while for /S = 0.4473 
the proximity of the critical point complicates the analysis. 



5-5 

Numerical performance analysis 

We consider here the Metropolis update algorithm described in detail in Sec- 



tion 3.2.1 for a D = 2 Ising model at the critical temperature. Notice that, since 
this is a local update algorithm, one would expect the autocorrelation times (see 
Appendix [A| for definitions) to scale as 

T oc (5.37) 

with z ~ 2, due to critical slowing down jHOH77[ IZJ03] 



In Table 5.8 we show the integrated autocorrelation times for the tethered mag- 



netic field at the values of the smooth magnetisation m corresponding to the min- 



imum and one of the maxima in p(m) (recall Figure 5.1 1. As we can see, there is 
hardly any evolution (in fact, the values are so small that we cannot even measure 
them properly, since times smaller than one are meaningless in a sequential update 
scheme). The same situation is reproduced if we consider the autocorrelation times 
for other magnetic observables (functions of the magnetisation m). This result is 
surprising, specially considering that naive conserved order parameter dynamics 
exhibit z = 4: — t] critical slowing down |hoh77| . 

The key is that our method imposes the constraint globally so, even though the 
update algorithm is local, the magnetisation has global information. In this case. 



the new dynamical exponent should be Znoniocai = z — 2 ^ | |tam89[ , as is the case 



for our method. Notice that this result implies that Tethered Monte Carlo is able to 
reconstruct the effective potential O^^ without critical slowing down for the Ising 
model. 

Other physical observables, however, do not enjoy the same nonlocal informa- 
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L 


fh = 0.5 


m = 1.14 


32 


0.491(9) 


0.564(4) 


64 


0.548(11) 


0.614(8) 


128 


0.580(14) 


0.654(8) 


256 


0.597(13) 


0.641(6) 


512 


0.639(4) 


0.661(4) 


1024 


0.632(4) 


0.665(5) 



Table 5.8: Autocorrelation times for the tethered magnetic field (&)„-, at the minimum, 7h = 
0.5, and close to one of the maxima of the p(m) for a critical D = 2 Ising model, simulated 
with a Metropolis algorithm. There is no critical slowing down for this observable. The 
times appear to grow for small sizes, but this is an artificial effect of the discrete nature 
of Monte Carlo time (notice that t > 1 / 2 by construction and that times very close to this 
limit are difficult to measure). 




m = 1.14 



150 200 



t 



Figure 5.7: Normalised autocorrelation functions po{t,th) for the energy and the propa- 
gator G2(fcmin) at Tc in an L = 128 lattice. We plot them at the central minimum of p(m) 
(left) and at one of the peaks (right). In both cases, the correlation function for G2 is al- 
most a pure exponential. Notice, however, that the correlation functions for both quantities 
become parallel, indicating that they share the same exponential autocorrelation time. 
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m 



FiGURE 5.8: Exponential autocorrelation times for tethered simulations of the D = 2 Ising 
model at the critical point, as a function of rh. The Texp, computed from the correlation 
function of G2(fcmin)/ are represented in units of the system size, showing a z « 2 critical 
slowing down. 



tion as b and therefore show standard critical slowing down. This is the case, for 
instance, of the energy. The quantity with the slowest dynamics seems to be to be 



the two-point propagator G2, defined in (5.5), We have plotted in Figure 5.7 the 
temporal autocorrelation for the energy and for G2 in an L = 128 lattice, at the 
central probability minimum and at the probability peak. At both magnetisations, 
the correlation functions for the energy and propagator become parallel for long 
enough times. This indicates that they both have the same exponential time, which 
we also expect to be the exponential time of the system as a whole. In addition, the 
Pg^{t,m) is almost a pure exponential, so we can make the approximation 

Texp = Texp,G2 - Tint,G2 • (5-38) 



Figure 5.8 shows this exponential time as a function of m for several system 
sizes. As we can see, the central region does exhibit critical slowing down. No- 
tice, however, that even for an L = 1024 system Texp is only ^ 10^, so this simple 
Metropolis algorithm can be used safely to study very large systems. As a compari- 
son, the exponential autocorrelation time of the canonical version of the Metropolis 
algorithm for L = 64 is already as large as that of the tethered version for L = 1024 
(see, e.g., jAMi03| l). 



CHAPTER VI 



Optimising Tethered Monte Carlo: cluster 
methods 



In the previous chapter we showed how a straightforward implementation of the 
tethered formalism, using as Monte Carlo dynamics the simple Metropolis algo- 
rithm, can be very precise and efficient. Eventually, however, the system size we 
can consider in the critical regime is capped by the appearance of critical slowing 
down, according to which the thermalisation times grow as T ^ L^, with z ~ 2. 

However, this z ~ 2 is not a hard limit imposed by the tethered formalism, 
but is instead determined by the Metropolis algorithm we have used to explore 
the tethered ensemble. If we had run our tethered simulations with some kind of 
optimised dynamics, the critical slowing down would have been reduced. In this 
chapter we consider the most spectacular example of optimised update algorithm: 
cluster methods. These are able to achieve z < 1, for all intents and purposes 
eliminating critical slowing down. Because of this, since their introduction in the 
1980s [| swe87[ |wol89[ , they have attracted a considerable interest, which lasts to 
the present day | cha98[ |deno7A[ |deno7B[ |DEN07q |DEN07Df idenioi igariiI . Despite 
this continued research effort, however, efficient implementations of cluster meth- 
ods remain restricted to very few situations. In particular, the development of a 
cluster update algorithm with conserved order parameter dynamics has long been 
considered somewhat of a challenge |jANN92|. 

The work reported in this chapter (whose results were originally published, 
in a reduced form, in |mmo9| ) shows how, using the tethered formalism, one can 
actually define a workable cluster update with a conserved order parameter. This 
is demonstrated in the context of the D = 2, 3 ferromagnetic Ising model, the ideal 
scenario for canonical cluster methods. Furthermore, the tethered version of the 
cluster update is shown to be just as efficient (in the sense of having the same z 
exponent) as the canonical one. 
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6.1 

The canonical Swendsen-Wang algorithm 

We summarise here the construction of the Swendsen-Wang algorithm, follow- 
ing ||edw88|| . The process is based on the Fortuin-Kasteleyn transformation [Ika.s69[ 
|FOR72i l, which maps Potts models onto bond-percolation problemsj^ We start by 
rewriting the partition function of the Ising model in the following way 

Z = E exp [/3 E 'r'y \ = E n [P^sx,s, + (1 - p)], (6.1) 



{sx} {x,y) 



{s4 

where 

p = 1 - e~2^. (6.2) 

We now notice that, trivially, 

[vK,Sy + (1 - p)] = E [V5sx,sy5n,l + (1 - p)5n,Q\ ■ (6.3) 

n=Q,\ 

Finally, we introduce this identity in the partition function by considering one aux- 
iliary variable n^y for each first-neighbours pair {x,y): 

Z = E E n [vK,Sy5nxy,l + (1 - V)KyA . (6.4) 
{sx} {nxy} {x,y) 

This partition function describes a model with (D + 1)N dynamic variables: the 
original N spins of the Ising model and the DN auxiliary variables nxy- We can 
think of the latter as bond-occupation variables. We say that a link xy is occupied if 
nxy = 1 and that it is empty if nxy = 0. Notice that the spins joined by an occupied 
bond must be aligned, while those joined by an empty bond can be either aligned 
or opposed. Therefore, the system has been partitioned into several clusters (two 
spins are in the same cluster if they can be connected by a path of occupied bonds). 

If we sum over the Sx for fixed bonds, we see that, while all spins in the same 
cluster must be aligned, the orientation of separate clusters is independent. Con- 
sider a bond configuration with I occupied bonds, defining Mq clusters (some 



of which may be single-spin clusters). We see in (6.4) that each occupied bond 
contributes a factor p to the weight and that each empty one contributes a fac- 
tor (1 — p). There are possible cluster orientations. Therefore, the marginal 
probability of the bonds is just 

cv{{nxy}) = Z-y{l - p)0^-^2-^c. (6.3) 

More interesting are the conditioned probabilities of the spins given the bonds 
and vice versa, which can be read directly from (64]), 



^The Potts model is a generalisation of the Ising model where the spins can take q different 
values, rather than just two. 
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(a) Given the {s^}, the bonds are independent from one another and nxy = 1 
with probability p&sx,sy and nxy = otherwise. 

(&) Given the {nxy}, all the spins within a cluster are aligned and two spins in 
different clusters are independent from each other. Each cluster has a positive 
or negative orientation with 50% probability. 

These conditioned probabilities can be used to define a Monte Carlo update 
scheme. Each lattice update consists of two steps: 

1. Given a starting spin configuration {sx}, the conditioned probability [a) is 
used to trace the clusters. This can be implemented in several ways, with an 
algorithmic complexity of 0(NlogN). See |jLAN05| or the code distributed 
with IAM103] for an example. 

2. Once the clusters are traced, they must be flipped to update the spin config- 
uration. The conditioned probability (&) is trivial, the sign of each cluster is 
chosen as ±1 with 50% probability. 

Each of these two steps satisfies the detailed balance condition. The combination 



of the two is irreducible and satisfies balance (see Section A.i for definitions). 

This scheme can be implemented for systems other than the Ising model (for 
instance, in an antiferromagnetic model, the clusters have uniform staggered mag- 
netisation), but it is for this system that it shows its full power. The critical slowing 
down is practically eliminated, with z < 1 in both two and three dimensions (see 
below), meaning that even for very large lattices a few updates are enough to 
decorrelate the system completely. 

6.1.1 Improved estimators 

Performing a single cluster update changes the spin configuration radically, almost 
instantaneously decorrelating the system. Therefore, not only is the system ther- 
malised very quickly, but one can also take measurements very frequently and 
obtain independent data (remember the discussion of integrated and exponential 
correlation times in Appendix A I. 

But the statistical gain of a cluster scheme does not end here. Indeed, in the 
Swendsen-Wang scheme, after tracing the clusters, we have to flip them randomly, 
and with equal probability for each orientation, in order to obtain our new system 
configuration. However, when taking measurements of some physical observables 
we can consider at once the average over all the possible cluster orientations 
with fixed {nxy}- We obtain in this way the so-called improved or cluster estima- 
tors |swe83[ Iwol88 ||. 



Let us call S, = ±1 the spin of the z-th cluster and its size. Then, the spin 
estimator for the magnetisation of the system is 

M = Y,Sx = YL^rS,. (6.6) 
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And the squared magnetisation is 



= Y^SxSy = J^ninjSiSj. 



(6.7) 



Let us denote by an overbar the average over the cluster configurations {S,}. 
Obviously, 

M = ^n,S, = 0. (6.8) 

i 

For the squared magnetisation, only the diagonal terms survive (remember the Sj 
are independent) 

M^ = E"?W = E4 (6-9) 

i i 

Obviously, (M^) = (M^), but the second estimator has more information and is 
therefore potentially more precise. We can do the same for other observables, for 
instance. 



M4 = 3En?n2-2Enf, 

i,i i 

= 15 E n}n]nl - 30 £ n]n^ + 16 £ nf , 

i,i,k i,i i 

M8 = 105 J] njnjnlnj - 420 J] njnjn^ + USj^^jn^j 

i,i,k,l i,j,k i,j 

+ 140 E nfn/- 272 J] nf. 
i,i i 



(6.10a) 
(6.10b) 



(6.10c) 



Consider now the propagator G2{k), Eq. (5.5 1, with spin estimator for a given con- 
figuratiorij^ 



G2{k) 



E 



Sxe 



ikx 



Y^SxSy 



(6.11) 



xy 



Now, when averaging over the cluster orientations at fixed {nxy}, two spins be- 
longing to different clusters are uncorrelated and two spins in the same cluster are 
equal, therefore 



G2{k) = E 

i=l 



xeC, 



ikx 



(6.12) 



The error reduction due to the adoption of cluster estimators is heavily dependent 
on the cluster distribution (see jFERogA] ), but it can be very large. See, for instance. 
Table 6.1 where we compute the renormalised coupling constants of the D = 
2 Ising model with several methods, and obtain a tenfold error reduction using 
cluster estimators. 



^As explained in Appendix [a) we use the Wiener-Khinchin theorem to write the Fourier Trans- 
form of the spatial correlation of Sx as the modulus of the Fourier Transform of Sx- 
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TMC 


SWS 


SWC 


-(«) 


1.226067(7) 


1.226 076(8) 




X2 


203.78(11) 


204.07(10) 


203.92(2) 




11.888(11) 


11.907(10) 


11.8932(10) 


re 


3.70(6) 


3.73(8) 


3.731(6) 


rs 


26.2(6) 


24(3) 


26.47(18) 


gi 


14.66(5) 


14.69(9) 


14.673(8) 




794(9) 


806(24) 


803.3(16) 




8.25(13) 


7.5(11) 


8.34(7) 


Table 6 


.1: Several observables for the D 


= 2 Ising model (the r, and gi are 


renormalised 



coupling constants, see licASOiil for definitions). We work at /5 = 0.42, with L = 100, 
and take 10^ Swendsen-Wang lattice updates. Compare the results with spin estimators 
(SWS) and with cluster estimators (SWC). We also give, for comparison, results with the 
Metropolis TMC algorithm (191 values of rh, with 10^ MCS on each). 



6.2 

The tethered Swendsen-Wang algorithm 

The Fortuin-Kasteleyn construction is just as easy to do in the tethered formalism. 
Remember the tethered weight 



coN{m, {s^}) oc e-^"+^-^(m - m)^^-^^/^0{m - m). (g^ 



Notice that the spin-connectivity term, given by the energy LT, is exactly the same 
as that of the canonical weight o^canonicai = exp[— /3Lr], and can thus be represented 
as in ( 6.4[ >. Therefore, the conditioned probability of the bonds given the spins is 



exactly the same in the tethered and the canonical ensembles. Only the conditioned 
probability of the spins given the bond varies, 

{a) Given the {sx\, the bonds are independent from one another and Uxy = 1 
with probability p5sx,sy arid rixy = otherwise. 

{h') Given the {rixy}, all the spins within a cluster are aligned and two spins in 
different clusters are independent from each other. Each of the cluster 
configurations {Sf} has probability 

p({Si}) cxe^-^(m-m)^^-^^^^0{m-m), M = Nm = Y^niSi. (6.13) 



Just as in the canonical case, the tethered Swendsen-Wang lattice update has two 
steps. First we trace the clusters using the conditioned probability (a). Then we 
flip them using {b'). This second step was trivial in the canonical ensemble, as all 
cluster configurations were equiprobable. In the tethered ensemble, however. 



90 



Optimising Tethered Monte Carlo: cluster methods 



we have to pick one using (6.13). A naive way to make this choice would be to 



select one configuration randomly and accept it with a heat-bath probability, 

E*i P({S,}) 

The problem with this method is obvious: A/c is potentially a very large number, 
so it is impossible to compute all the weights and average them. 

Clearly, we need a more manageable scheme. In order to construct it let us first 
recall the Wolff or single-cluster algorithm |wol89| . With this (canonical) method, 
we randomly pick one spin and immediately trace and flip the cluster to which it 
belongs. Since the clusters are flipped with probability proportional to their size, 
this method is able to cause a very large change in the system. 

In the tethered formalism we cannot take the same approach, because the 
weights of the configurations with ±S/ are different. We can, however, make use 
of the notion that not all clusters need be flipped. Indeed, we can select A/"^ ^ A/c 
clusters and consider a heat bath of the 2^c configurations {S[} where all the other 
clusters are fixed. In principle, one could select one of the {S^} randomly and 
accept it with probability 

E^i p({s;}) 

This is alre ady a manageable computation, but there still remains a problem. Since 
the weight (6.13 1 is heavily dependent on M, only a few of the configurations 



will have non-negligible weight. Therefore, a method where we choose one ran- 



domly and check whether it is accepted with (6.15 1 is going to have a very low 
acceptance. It is better to use a modified representation of the heat bath algorithm, 
where we construct a vector with the cumulative probabilities 



E^iP({s;}) 

LfApiis]})' 



Ak = Ao = 0. (6.16) 



We then extract a uniform random number < 7^ < 1 and choose the configuration 
{S^} such that 

< 7e < Afc. (6.17) 

We still have to decide how to choose the Mq clusters that we will attempt to 
flip. Two simple choices, both satisfying the balance condition, present themselves: 



Select the Mq largest clusters. 

Select Mq spins randomly and note their clusters (in case of repetition, we 
keep drawing spins until we get Nq different clusters). 
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Figure 6.1: Spin overlap, Eq. (6.18 1, in a D = 2 critical L = 512 Ising model. We plot the 
results, whose errors cannot be discerned at this scale, for the central minimum of the p(m) 
and for one of its maxima. 



It may seem that the first choice is preferable, with more potential for changing the 
system quickly. However, nothing says that we can only select the Mq clusters and 
flip them once, before retracing the clusters. In fact, tracing the clusters is a rela- 
tively costly operation, 0(NlogN), so the flipping time is negligible. Therefore, 
we can adopt the following algorithm as our lattice update 

1. Trace the clusters in the system. 

2. Flip the clusters. For t = 1,. . . , A/'rep do 

(a) Select Mq clusters with the second of the methods described above (we 
use Mq = 5, but this number can be changed). 



(b) Obtain the {S^^} with the heat bath of Eq. \6.i6\ , fixing the A^c - 
remaining clusters in the orientations they had at f — 1. 

We are therefore using a dynamic Monte Carlo in order to pick the cluster orienta- 
tions, rather than a static Monte Carlo, as is the case for the canonical algorithm. It 
may seem counterintuitive that the optimal value of A/'rep is rather large. In order 
to see it, let us consider the overlap 



(6.18) 



N(l-(m)5,) 

Notice that would vanish for completely uncorrelated configurations. We have 



plotted this quantity in Figure 6.1 Clearly, the configuration can significantly 
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evolve for a fixed distribution of the bonds. We have empirically found that an 
A/'rep that equilibrates the cluster tracing and cluster-flipping times is close to opti- 
mal and convenient (one simply scales A/'rep with N). For the largest systems that 
we will consider here, with N = 256^ spins, this results in A/'rep ~ 5 x 10^. 

In the above discussion we have constructed a Monte Carlo algorithm capable 
of quickly and non-locally changing the spin configuration, potentially accelerating 
thermalisation. Remember, however, that in a canonical setting this was not the end 
of the story: we could obtain significant error reductions by employing improved 
estimators. In our case, even if we cannot write closed formulas such as (6.8-6.12) 
we can get an even better enhancement by taking measurements of the relevant 
observables at each of the A/'rep steps. In Table 6.2 we can see that for the all- 
important tethered field b this nets us a factor 29 in error reduction for an N = 256 
system, where we use A/'rep = ~ 5.2 x 10^, an effect equivalent to considering a 
simulation 841 (= 29^) times longer. 



A/" rep 


(&^^S)m=0.5 X 10^ 


(&^'^P)*=0.5 X 10" 


Error ratio 


20 


-1.9224 ±5.2934 


-1.9224 ±5.2934 


1.00 


2i 


-1.7207 ±3.5529 


-2.1219 ±3.5332 


1.01 


22 


3.2730 ± 2.7653 


3.5203 ± 2.7173 


1.02 


23 


-7.5114 ±1.9301 


-8.1937 ±1.9104 


1.01 


24 


-1.8460 ±1.6477 


-1.8712 ± 1.5495 


1.06 


2^ 


-2.2291 ± 1.3728 


-0.4083 ± 1.1968 


1.15 


26 


-0.5641 ± 1.3838 


0.6485 ± 0.9193 


1.50 


2^ 


-0.4048 ± 1.3637 


-0.4720 ± 0.6707 


2.03 


28 


2.2643 ± 1.3579 


-0.0167 ± 0.4785 


2.84 


29 


0.6270 ± 1.2368 


0.2684 ± 0.3510 


3.52 


210 


2.3544 ±1.1953 


0.3511 ±0.2633 


4.54 


2II 


-1.0922 ±1.3222 


0.2449 ± 0.1813 


7.29 


212 


0.4522 ± 1.3126 


0.2188 ± 0.1359 


9.66 


213 


-0.7033 ± 1.3030 


-0.0485 ± 0.1227 


10.62 


214 


1.0856 ± 1.2832 


0.0181 ± 0.0849 


15.11 


215 


1.5875 ± 1.3750 


0.1504 ± 0.0864 


15.91 


216 


-0.2056 ± 1.2189 


-0.1113 ±0.0671 


18.17 


217 


1.9023 ± 1.4305 


0.0963 ± 0.0593 


24.12 


218 


0.3395 ± 1.3451 


0.0064 ± 0.0607 


22.16 


219 


-2.5036 ± 1.3495 


0.0166 ± 0.0460 


29.34 



Table 6.2: Evolution of (&)m=o.5 and its error with increasing A/'rep for simulations of an 
N = 256^ system at /3c with Amc = 50 000 MCS. We compare the result of measuring b 
only after each MCS with the result of measuring it after each of the A/'rep x Amc cluster 
flippings. The error in the first estimator is quickly saturated but the second keeps getting 
more precise even for very large A/'rep. The last column gives the ratio of the errors of both 
estimates. Since we are working at the probability minimum, we have (&)m=o.5 = 0, so the 
central value can be taken as a check that the errors are correctly estimated. 
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6.3 

Numerical performance analysis 

In order to assess the efficiency of the tethered Swendsen-Wang method, we have 
computed the integrated autocorrelation times for several observables. As in the 
Metropolis case (Section 5^, they are largest for the m = 0.5 (m ~ 0) region. 



However, unlike the local case, now the energy is the slowest observable (this is 
a common feature of cluster methods). Therefore, in order to evaluate the overall 
performance of the method, we concentrate on the energy at m = 0.5. In the next 
section we give a second measurement of the performance, in terms of the precision 
of the computed physical observables. 



We show Tint,E at m = 0.5 for the D = 2,3 Ising model in Tables 6.3 and 6.4 
For D = 2 we have worked at j6 = j6c = log(l + \/2)/2 and for D = 3 at /S = 
0.22165459 ^ |bl699| . 



We consider both the cluster algorithm described in the previous section and a 
mixed scheme, where we take two Metropolis steps between cluster updates (the 
cluster update takes more time to perform, so this does not change the running 
time noticeably). We have computed the integrated time with the self-consistent 
window method described in Section IA.2.1I 

We also compare our results with the corresponding autocorrelation times for 
the canonical Swendsen-Wang algorithm, taken from [iSALOoil for the D = 2 cas^ 
and from posso4B| for D = 3. Recall that the Ising model at the critical point is the 
ideal setting for these methods. 

Notice that in both dimensions the integrated times of the tethered pure cluster 
method are smaller than those of its canonical counterpart, although the dynamic 
critical exponents are comparable. The mixed algorithm has even smaller corre- 
lation times, but in D = 2 it does not follow a power law in L and in D = 3 the 
resulting ze is larger than the one for the pure cluster method. This probably means 
that the asymptotic regime has not yet been reached for the mixed algorithm. Pre- 
sumably, for large enough systems it will scale as the pure cluster one. However, 
since for our range of lattice sizes the combined algorithm has smaller t, it is the 
one we shall use to compute physical results in the next section. 

In short, the critical slowing down of the Tethered Monte Carlo method, already 
absent for magnetic observables with a local update scheme, can be removed just as 
completely as in the canonical case with the application of cluster methods. This is 
a demonstration that adopting the tethered formalism does not imply abandoning 
optimised update schemes, nor does it hinder their performance. 



3The quoted values are reported by Salas and Sokal only in the preprint version of IIsalooL as a 
combination of their simulations and those of [sAigij . 
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L 


TMC (Cluster) 


TMC (Met. + Cluster) 


Canonical 


16 


2.310(14) 


0.775(3) 


3.253(8) 


32 


2.758(20) 


1.055(5) 


4.011(11) 


64 


3.347(22) 


1.417(7) 


4.891(11) 


128 


4.11(5) 


1.861(12) 


5.510(20) 


256 


4.87(4) 


2.391(16) 


6.928(22) 


512 


5.79(8) 


3.040(24) 


8.107(25) 


1024 


6.78(8) 


3.70(4) 




Ze 

X^/d.o.i. 


0.241(7) 
0.36/2 




0.222(7) 



Table 6.3: Integrated autocorrelation times for the energy at m = 0.5 and /3 = /3c for the 
D = 2 Ising model. We compare the cluster and mixed versions of our TMC algorithm. 
We also include the results of [ saloo] for canonical Swendsen-Wang. For the pure cluster 
algorithm we fit to Te = AL^^, in the range L > 128. Our resulting value for the dynamic 
critical exponent is very similar to the result of IIsalooI for the canonical algorithm. 



L 


TMC (Cluster) 


TMC (Met. + Cluster) 


Swendsen-Wang 


16 


2.135(13) 


0.782(3) 


5.459(3) 


32 


2.80(3) 


1.134(5) 


7.963(9) 


48 


3.467(28) 


1.427(8) 


9.831(9) 


64 


3.88(3) 


1.700(10) 


11.337(12) 


96 


4.79(5) 


2.152(14) 


13.90(3) 


128 


5.46(6) 


2.566(17) 


15.90(5) 


192 


6.54(11) 


3.32(4) 


19.10(9) 


256 


7.51(13) 


3.85(5) 


21.83(10) 


Ze 


0.472(8) 


0.591(4) 


0.460(5) 


X^/d.o.f. 


5.85/5 


4.61/5 





Table 6.4: Integrated autocorrelation times for the energy at m = 0.5 and /5 = /Sg for the 
D = 3 Ising model. We compare the cluster and mixed versions of our TMC algorithm 
with the results for the canonical version reported in josso4Bj . Our value of Ze comes from 
a fit for L > 32. 
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6.4 

Physical results for the ferromagnetic Ising model 

In this section we compute some physically relevant quantities for the critical Ising 
model in D = 2 and D = 3. In the former case, we compare the the tethered 
cluster simulations with our Metropolis results of Chapter [5] and in the latter with 
the canonical Swendsen-Wang results of [Iosso4b|. 



We give our D = 2 results in Table 6.5 We have used the same grid of 77 
simulation points as in the Metropolis simulations of Chapter ^ but now we have 
performed 10^ cluster updates instead of 10^ Metropolis ones. As we can see, the 
errors of the Swendsen-Wang algorithm are about 5 times smaller for L = 512 and 
10 times smaller for L = 1024. 

For D = 3 we have simulated an N = 128'^ lattice at /3c, taking 10^ cluster steps 
on each of the points in a grid of 50 values of m. We compare with the canonical 
Swendsen-Wang simulation conducted in josso4B| , with a total of 4.8 x 10^ cluster 
steps. This results in a similar number of total lattice updates (and, therefore, 
simulation time) for both simulations. In accordance with our autocorrelation time 
study, the statistical errors in the canonical averages computed with both methods 



are similar (see Table 6.6 on page 97 



Let us recall, however, that, while the results for zero applied field are similarly 
precise in the canonical and tethered computations, the latter has more information 
about the system. This is because its results can be reweighted to yield canonical 
averages for non-zero applied field. 

As a final test of our method's accuracy and as a demonstration that it can do 
more than merely reproduce canonical averages we shall reproduce the study of 



Section 5.2.2 We shall compute the anomalous dimension of the D = 3 Ising model 



from a FSS analysis of the peak position in p{m] 



L 


-(«) 


X2/L^ 




C 


B 


512 (MTMC) 


1.41542(4) 


0.2293(7) 


0.903(6) 


16.57(3) 


1.168(2) 


512 (CTMC) 


1.415435(7) 


0.229 71(13) 


0.9064(11) 


16.588(10) 


1.1675(4) 


512 (E) 


1.415429... 






16.595404... 




1024 (MTMC) 


1.41489(4) 


0.1949(15) 


0.919(15) 


18.28(8) 


1.163(6) 


1024 (CTMC) 


1.414819(4) 


0.19302(15) 


0.9053(14) 


18.332(16) 


1.1677(6) 


1024 (E) 


1.414821... 






18.361 348. . . 





Table 6.5: Results at the critical temperature for the D = 2 Ising model. We compare the 
result of taking 10^ MCS with a Metropolis implementation of TMC (MTMC) with that of 
taking 10^ MCS with the tethered Swendsen-Wang scheme (CTMC). For the energy and 
specific heat we also give the exact results of |fer69| . 
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Recalling the analysis of Section 5.2.2 and the scaling relation 2/3 /v = D — 2 + f], 



Eq. (2.19), we have 



m 



peak 



1 

2 



(6.19) 



Our results for the D = 3 Ising model can be seen in Table 6.7 For these simulations 



we have first located the approximate position of the peak with a short sweep in m 
and then run long simulations for only two very close m values, one at either side 
of the peak|4] The peak position is then computed from the saddle-point equation 



(6.20) 



with a simple linear interpolation. 

We show the result of fitting the data in Table 6.7 to (6.19 1 in Table 6.8 We 
obtain good fits already with Lmin = 48 and excluding more points results in 
compatible values of rj, with growing errors. Our preferred final estimate is ;y = 
0.0360(7), combining the central value for Lmin = 48 and the more conserva- 
tive error for Lj^in = 64 to account for systematic effects. This value should 
be compared to the best determinations known to us, a Monte Carlo computa- 
tion giving t] = 0.03627(10) [iHASioi and a high-temperature expansion value of 
f] = 0.03639(15) |cAM02 j | . Both quoted values, however, were computed with a 
'perfect' action, not directly in the Ising model. 



^Actually, for each lattice size we ran many independent runs, whose results were later averaged. 
The table gives the total of MCS, adding those of all the individual runs. 
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Method 


MCS 


-(") 


C 


X 


^2 


Canonical SW 
CTMC 


48 X 10^ 
50 X 10^ 


0.992946 6(48) 
0.992949 3(45) 


66.465(54) 
66.522(40) 


21 193(13) 
21202(13) 


82.20(3) 
82.20(6) 



Table 6.6: Comparison of canonical Swendsen-Wang (data from |osso4b| ) with TMC for an 
N = 128"^ lattice at j6c. We take 10^ MC steps at each of the 50 points of our m grid. This 
results in a similar number of MCS for both simulations. 



L 


MCS 




1 

^peak 2 


16 


1.0 X 10^ 




0.33421(5) 


32 


1.0 X 10^ 




0.233 77(4) 


48 


1.0 X 10^ 




0.18956(4) 


64 


1.0 X 10^ 




0.16341(4) 


96 


1.0 X 10^ 




0.13240(4) 


128 


1.0 X 10^ 




0.114083(24) 


192 


6.0 X 10^ 




0.09246(4) 


256 


8.2 X 10^ 




0.07959(12) 


Table 6.7: Position of the peak of p{m; L) for the D = 3 Ising model at f^c- 




V 




;tVd.o.f. 


16 


0.03392(21) 




42.6/6 


32 


0.03526(30) 




8.79/5 


48 


0.036 0(5) 




4.24/4 


64 


0.0368(7) 




1.02/3 


96 


0.0363(12) 




0.78/2 


128 


0.0373(19) 




0.31/1 


192 








256 








Table 


6.8: Fits to ffipeak - 2 = AL-(°-2+,7)/2 ^^xe data in Table 


6.7 


for several ranges 



Part III 

The Diluted Antiferromagnet in a 

Field 
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CHAPTER VII 



The state of the art: the DAFF with canonical 
methods 



In this Part we intend to demonstrate the first of our two main points: when study- 
ing complex systems, one should use a statistical ensemble tailored to the problem 
at hand. We do this in the context of the diluted antiferromagnet in a field (DAFF), 
a system that had been extensively studied with canonical methods, but that re- 
mained very poorly understood. In the present Chapter we start by giving an 
introduction to the model (and its universality class), noting the difficulties that 
have been encountered in the different approaches. We then, in Section 7.2 present 
the main observables and summarise the theoretical expectations for the critical be- 
haviour. In Section 7.3 we demonstrate the difficulties of a canonical MC approach 
to the problem. The next Chapter, finally, presents our study of the DAFF with the 
tethered formalism. 



7-1 

Introduction 

In Chapter [2] we introduced the concept of quenched disorder and explained how 
it can radically affect the collective behaviour of a physical system, to the point that 
disordered models are prime examples of complex systems. Here we consider a 
particularly interesting class: magnetic systems with random fields. 

In particular, we are interested in the random field Ising model (RFIM): 

^ = - E s^Sy - Y^Ksjc = LT + Ep. (7.1) 
The hx are quenched and independent random variables, typically chosen so that 
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Figure 7.1: Inverted compact domain in a ferromagnetic system. 



hx = 0. Let us in addition assume they are Gaussianly distributed and 



hxhy = SxyH. 



(7-2) 



Recall that the overline denotes the average over the disorder. In the language of 
Chapter [2I then, a sample is a choice of {hx}. 

Notice that the pure system is in this case the ferromagnetic Ising model studied 
in Part |ll| whose critical behaviour is well understood. In particular, the phase 
diagram for D > 1 consists in a low-temperature phase with ferromagnetic order 
and a high-temperature paramagnetic phase, connected through a second-order 
phase transition. In principle, the presence of a small random field would disturb 
the ferromagnetic order and lower the transition temperature. For a high enough 
value of the field strength H the critical temperature would go to zero and the 
ferromagnetic phase would be destroyed. 

That this naive qualitative picture of the phase diagram hides a more complex 
and interesting situation was first demonstrated by Imry and Ma in 1975 jiMR73] . 
In particular, they showed how, for a low spatial dimension, even an infinitesimal 
H can destroy the ferromagnetic order completely. Since this argument is very 
simple and elegant, it is worth recalling here. 

Let us consider a system with a predominantly ferromagnetic order, with most 
of the spins in the ( — ) orientation. In this situation, we flip a compact domain of 



radius R (see Figure 7.1). 

The value of the exchange interaction energy in the bulk of this domain does 
not change with this reversal, but all the links across its boundary are inverted. 
Therefore, there is an increase ALT ^ R^~^. As to the field-interaction energy the 
average value of hx inside the domain is of course zero. On the other hand, the 
variance of the sum of n Gaussian variables, each of variance H, is y/nH. That is, 
we have AEp = 2Ep ^ HR^^^. Choosing a region with positive Ep, we then have 



AE = ALT + AEp = aR 



D-l 



hHR 



D/l 



(7-3) 
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where a,b are positive constants. Therefore, ifD — 1 < D/2 we see that these 
perturbations are actually energetically favourable, for large enough R. Hence, 
the ferromagnetic phase is unstable for D < 2. The marginal case D = 2 was 
considered in |bin83[ and also found to be unstable. Therefore, the lower critical 
dimension for the RFIM had to be Di > 2 (other mechanisms than the domain-wall 
argument could destroy the long-range order in three or more dimensions). The 
issue was solved by the rigorous proof that Dj = 2 |imb84[ [bri87[ | AIZ89 1 . 

The disorder in the RFIM was thus showed to be strong enough to modify 
the phase diagram of the pure model profoundly, yet not so radical as to make 
the phase transition disappear altogether. The interest in the system was further 
enhanced when it was shown that it could be faithfully realised physically by a 
diluted antiferromagnet in a uniform field (DAFF) jFisygf [car84 [. This ignited a 
considerable experimental effort (see |bel98| for a review). 

In short, it was shown more than twenty years ago that the RFIM experiences 
a phase transition in three spatial dimensions and that the system is also of exper- 
imental interest through the DAFF. Despite a continuing analytical, experimental 
and numerical effort, however, almost all the details of this transition remain con- 
troversial. 

In particular, characterising the phase transition proved to be no easy task from 
an analytical point of view (see [ nat98[ |domo6II for reviews of theoretical results). 
First of all, we have the general observation that the upper critical dimension for 
disordered systems is Du = 6 (as opposed to Du = 4 for pure models, cf. Chapter [2] 
and Section 7.2.1 below). Therefore, it is more difficult to predict the behaviour in 
the physically interesting case of D = 3 from renormalisation-group expansions in 
e = Du - D. 

An example of the problems with perturbation expansions is afforded by the 
issue of dimensional reduction. In 1979, Parisi and Sourlas |jPAR79B| presented an 
elegant supersymmetry argument by which the critical behaviour of the RFIM at 
dimension D should be equivalent to that of the pure model in D — 2. But this had 
to be wrong: we have already seen that there is a phase transition in D = 3, while 
the Ising model has no ordered state in D = 1. Parisi pointed out a possible flaw in 
the supersymmetric argument as soon as 1984 |par84 |, but the issue has remained 
an active topic of research (see, e.g., liBREOiilxisiil and references therein). 

The experimental and numerical approaches to the problem are similarly pla- 
gued by severe intrinsic problems. The main one is perhaps posed by the peculiar 
critical behaviour of the DAFF /RFIM ]vil83[ liis86Al [nat98[ . In general, most au- 
thors work in the framework of an unconventional scaling theory, which we shall 
explain in Section 7.2. i[ where the spatial dimension D is replaced by D — in 
the traditional hyperscaling relation ( 2.i9d[ >. Here ^ is a new critical exponent, be- 
lieved to be ~ 1.5 from theoretical arguments (cf. Section 7.2.1 ) but inaccessible to 
a direct computation both in experimental studies and in (canonical) Monte Carlo 
simulations. 

The complete uncertainty in the determination of 9 has allowed experimental 
and numerical works to report qualitatively different sets of values for the other 
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critical exponentsj^ One example of this is the experimental claim for a divergence 
in the specific heat | |bel83[ |bel98 \, not observed numerically liHAROij . We must 



note that the specific heat exponent is notoriously difficult to estimate numeri- 
cally I|malo6L especially considering that the expected divergence is very slow. 

From the experimental point of view, an additional problem is posed by the 
uncertainty over the analytical form of the scattering line shape or structure factor. 
Different ansatze lead to mutually incompatible results for the thermal critical ex- 
ponent, from V = 0.87(7) jsLAgg] to v = 1.20(5) |jYE04|. The first of these works 
also reported a value of // = 0.16(6) which, combined with a divergent specific heat 
{oi > 0), would violate hyperscaling if ~ 1.5. 

On the numerical front, the determination of v has been at least as difficult, 
with estimates ranging from v !=s 1 to v > 2 (see, e.g., | |nat98[ Iwuo6[ iviNioi for 
detailed lists). The most precise values have typically come from ground state 
studies. Hartmann and Nowak claimed v = 1.14(10) in jHARgg] , although the 
value has recently been increased, with a recent estimate of 1.37(9) [mido2] (other 
recent works |MALo6l lFYTiii give similar results). In general, these latest numerical 
estimates are outside the experimental range of values and only barely compatible 
with a divergent specific heat. Another recent work, ||wuo6ll , gives v = 1.25(2) and 
Oi = —0.05(2). However, these values suffer from similar problems as experimental 
estimates (with which they are compatible) in that they do not satisfy hyperscaling 
bounds. 

An additional source of uncertainty is the extremely low value of the magnetic 
critical exponent, believed to be jS ^ 0.01. This exponent is also determined with 
almost no precision | nat98[ [har99|, the latest estimate of which we have notice 



being /3 = 0.007(5) |FYTii j | . The fact that /3 is compatible with the first-order result 
of /3 = has led some authors to suggest the possibility that the phase transition 
in the DAFF may not be continuous. This claim is typically based on the finding 
of metastable signatures |sou99} |MAi07f |wuo6| | and casts doubts on the supposed 
universality between the DAFF and the RFIM. 

A final issue is the lack of self-averaging in the DAFF, which has been exten- 
sively studied |paro21 Imalo61 iFYXiij and complicates any numerical analysis. 

At the root of these problems is the deep physical issue of free-energy barriers, 
discussed in Chapter [3} Both experimentally and in canonical MC simulations, the 
system gets trapped in local minima, with escape times log t ^ which makes it 
exceedingly hard to thermalise even relatively small systems and ensures that the 
statistics will be dominated by extremely rare events. 

In the remainder of this chapter, after giving some definitions in Section 7.2 we 
shall examine more closely the reasons why canonical MC simulations of the DAFF 
fail. 



^Notice that only three exponents are independent, the other being fixed through scaling and 
hyperscaling relations. 
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Model and observables 

We consider the diluted antiferromagnet in a field (DAFF), defined by the following 
Hamiltonian (as always N = Ising spins in a cubic lattice) 

^ = E ^xSxeySy - ^ E ^^^^ - E ^^cS^TTx, TT^^ = e'''^f'=i (7.4) 

In addition to the Ising spins {Sx = ±1), this Hamiltonian includes quenched oc- 
cupation variables ex- These are equal to one with probability p and equal to zero 
with probability (1 — p) and characterise the dilution of the system. The value of 
p is not supposed to be important to the physics, so long as we keep our distance 
both from the percolation threshold of pc ~ 0.31 |sta84[ and from the pure casej^ 



The Cx are quenched variables, which, recalling Section |2.2[ means that we should 
perform first the thermal average for each choice of the and only afterwards 
compute the disorder average. In the canonical ensemble we denote the thermal 
average by ( • • • ) and the disorder average by ( • • • ) . 



The Hamiltonian (7.4 1 includes a two-component applied field {h,hs), coupled 



to the regular and staggered magnetisations 

M = Nm = Y,exSx, (7-5) 

X 

Ms = Nnis = Y^exSxTTx, ■ (7-6) 

X 

We shall continue to denote by U the spin interaction component of the energy 

U = Nu= ^ exSxCySy. {7.7) 

{x,y} 

Therefore, the total energy £ of a given spin configuration can be written as 

E = U-hM- hsMs. (7.8) 

In general, one is interested in the case hs = 0, but we will find this parameter 
useful later on. 

In order to study spatial correlations we need to consider the staggered Fourier 
transform of the spin field 

cp{k) = i:Sxnxe-''-\ (7.9) 



^Some experimental works IIbarooI have pointed out possible non-equilibrium effects if the con- 
centration is low enough to permit percolation of vacancies (p < 1 — pc ~ 0.69), although the issue 
is not completely understood [YE02t[SHE04j . 
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7.2.1 Phase transition in the DAFF: theoretical expectations 

In this section, we give some heuristic arguments justifying the modified scaling 
relations expected for the DAFF, assuming a second-order scenario|3] 

Let us consider the system at the paramagnetic-antiferromagnetic transition 
point Tc{p,h) for a given dilution p and field h. Now let us consider the effect 
of introducing a staggered magnetic field hs- Unlike h, h^ is coupled to the order 



parameter of the transition, so by definition (2. 18 1 of the critical exponent 5, 



Ms - hl^^. 



Now, recalling that mg ^ xi , from (2.13 1 and (2.16 1 we have 

h, ^ r^'/^ 



(7.10) 



(7.11) 



In addition, if we divide the system in the even and odd sublattices, the dilution 
implies that one is going to have ^ ]_PI'^ more spins than the other. This causes 
an excess staggered field bh^ ^ Now, if we want the system to have a real 

transition as we approach the critical point hs — > 0, we need 



> 0. 



(7.12) 



That is, we need 



D 

2" 



> 



a. 



2v 



(7-13) 



where we have used the scaling relation (2.19b I. 

If we plug the mean-field exponents into this equation, we obtain D/2 > 3. 
Therefore, the upper critical dimension is in this case Du = 6. 



Below Du we can in principle apply the hyperscaling law (2.i9dl. However, if 
we do this we obtain 



vD 
vD 



> 



2- a 
2 + 7 



7 < 0. 



(7-14) 



Since the susceptibility critical exponent 7 must be positive, this is not possible. 

The answer to this problem was given independently by Villain |vil83[ and by 
Fisher | |fis86a ||. We consider the system at the length scale of the correlation length. 
According to Widom scaling (see, e.g., jAMi03i l), the free energy of this correlation 
volume is 



f (^) - ^ 



D-{2-a)/v 



(7-15) 



In conventional systems, the scale of variations of f (^) is set by the thermal fluctua- 
tions, so F(^) is of order one and the usual hyperscaling law vD = 2 — ol follows. In 



3 We follow closely the arguments given in ]nat98| and IIfis86aI for the RFIM, but adapt the 
language to the antiferromagnetic system. 
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our disordered system, however, the fluctuations due to the randomness dominate 
and 

m - (7-i6) 



where is a new, independent, critical exponentj^] Comparing ^7.13 } with (7.16) we 
read off the modified hyperscaling relation 

2-ci = v{D-9). (7.17) 

In addition, from i y.ij^ we obtain the inequality 

9>^/v = 2-r]. (7.18) 



This last relation has been proven rigorously for a class of models ]sch83[ 



Some authors |aha76[ Isch86[ have suggested that the above inequality is satu- 



rated for the DAFF, giving 

e = 2-rj, (7.19) 

so we would again have only two independent critical exponents. 



Several heuristic arguments have been given to motivate Eq. (7.19) for the RFIM 
(see, e.g., INAT9 8I), but we can also examine the issue directly in the DAFF. 

Let us consider the system in the low-temperature phase with no field, so there 
is a non-zero spontaneous staggered magnetisation mg (in one of two orientations). 
As we said before, one of the sublattices (let us say the even one), has ^ L^^^ 
more spins than the other. When the magnetic field h is introduced, of the two 
orientations for the same |ms| the one where the even spins are aligned with the 
field will be more favourable. 

Now, let us consider a region of radius R where the odd sublattice is dominant 
(always possible to find, if the system is large enough). Clearly, inverting the mag- 
netisation within this region aligns more spins with the field, lowering the total 
energy. The energy gain is ^ —R^^^m^h. 

On the other hand, this inversion also has an energy cost, due to the surface en- 
ergy. Therefore, the probability of inverting this region is going to be proportional 
to the exponential of 

F{R) = R^/^ffis - ER^~\ (7.20) 

where E is the surface tension and we have neglected irrelevant constant factors 
in both terms. By definition of correlation length, this probability is maximum for 
^ = R, where F'{R = ^) =0. Therefore, 

F(0 - ^^/'m, ^ E^^-\ (7.21) 



Finally, by definition (7.16 1 of 6 and using nis ^ ^ 

e ^ l^l^m, ^ (7.22) 



^From the naive Imry-Ma argument of Section 



7-1 



we would expect 9 = D/2. 



io8 



The state of the art: the DAFF with canonical methods 



and we have 



e = D/2-^/v, 



(7-23) 



which is equivalent to (7.19 1. 

On the other hand, from f(^) ^ and (7.21 1, the surface tension is seen to 
scale as L ^ c^i^ice we are assuming a second-order scenario, this has to 

vanish in the thermodynamical limit and we have 



e<D-i, 



(7-24) 



(this last bound can be obtained without assuming the two-exponent scenario). 

Finally, let us close this section by mentioning that the need for a third exponent 
is related to the existence of two correlation functions that scale differently. 



1 



(7.25a) 
(7.25b) 



In this equation, (p{k) is the staggered Fourier transform of the spin field, defined 
in { 7^ , and f] is 

f] = 2 + r]-9. (7.26) 

Notice that in the two-exponent scenario, fj = 2r] and the disconnected propagator 
diverges as the square of the connected one 



(7-27) 



7-3 

The DAFF in canonical Monte Carlo simulations 

In this section we carry out canonical Monte Carlo simulations of a DAFF system, 
to demonstrate the problems inherent in the traditional approach and motivate our 
tethered study of Chapter |8| 

In these simulations we have used parallel tempering, a common method to 
thermalise systems with free-energy barriers. The basic idea is running several 
copies of the same sample of the system concurrently, each at a different tempera- 
ture. Every so often, we attempt to exchange copies at neighbouring temperatures. 
When the temperature is raised, the energy fluctuations increase and copies that 
would get trapped in metastable states can escape (see Appendix A for a full ex- 
planation of this method). 

In our case, we have not attempted to conduct a full canonical study of the phase 
transition, we have just simulated a few samples. For each of them we performed 
many parallel tempering runs, using 40 temperatures evenly spaced in the range 
1.6 < T < 2.575 and choosing the field such that fih = 1.5 (in this way, we intend 
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Figure 7.2: Time evolution of the interaction energy u{t) — Eq. \y.y\ — for the configuration 
at T = 1.6, j6 = 2.4 in 50 different parallel tempering simulations of the same sample. We 
plot 50 out of a total of 100 runs. After a variable simulation time, most of the simulations 
fall into one of two metastable states, whose energies Ui and M2 we mark with straight 
black lines. In a thermalised situation, we would see timnelling between the two states (or 
all of the runs reaching the same one, if only one is dominant). 



to cross the phase diagram with a diagonal straight line). We considered a system 
size of L = 24 and a dilution of p = 0.7. These parameters are taken from jMAio/j j^j 
Figure 7.2 shows the time evolution of the interaction energy u{t) for the lowest 
temperature (T = 1.6, h = 2.4) for 50 runs of a single sample (we performed 
a total of 100 such runs, but we only show 50 to avoid cluttering the graph). 
We can see how, after a variable simulation length, most of the systems reach 
one of two metastable states, with interaction energies m*^^^ = — 1.07 7 35(8) and 
u^"^^ = —1.0973(7). The corresponding total energy densities, Eq. (7.81, are e^^^ = 
-1.376 08(2) and e^^) = -1.3820(2). The same qualitative picture is obtained for 
different samples. 

The first conclusion we can draw from this plot is that parallel tempering has 
failed. Assuming ui and U2 represent intermediate metastable states, not represen- 
tative of the system's equilibrium phase, we have failed to escape from them and 
reach the relevant regions of configuration space. On the other hand, assuming 
both are important to the equilibrium phase, then in a thermalised simulation we 
should see tunnelling between the two. Notice that obtaining the two estates in 
separate runs is not enough, we need to see the tunnelling in order to know their 
relative weights. However, we have not seen a single such jump, once the simula- 
tion reaches one of the two states it never escapes. In total we performed 100 runs 



51n fact, the authors of |maio7| go to even lower temperatures. Here we make things easier for 
the parallel tempering by going no further down than T = 1.6, which the authors of [ iMAi07j foimd 
to be the critical temperature along the line fih = 1.5. 
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Figure 7.3: Scatter plot of the interaction energy u and the staggered magnetisation for the 



100 runs of Figure 7.2 In each case we average the observables over the last 10 000 MCS, 
corresponding to the last 0.1% of their whole Monte Carlo history. We see that the states 
with energies Ui ^ —1.077 and U2 ~ —1.097 correspond to systems in an antiferromagnetic 
phase (with opposite orientations). 



for this sample, each with 10'' parallel tempering updates, which suggests that the 
tunnelling probability is upper bounded by 10~^. In other words, parallel temper- 
ing is not able to thermalise the system in a reasonable amount of time. Notice that 
some of the simulations have not even reached one of the two metastable states 

The second problem with the canonical approach is that of interpreting the 
results. Suppose we had considered runs many orders of magnitude longer than 



the ones plotted in Figure 7.2 Then, we would eventually begin to see the quick 
jumps from one state to the other, separated by long stays in each of them. This is 
the sort of metastable behaviour that one expects in the neighbourhood of a first- 
order phase transition, where two different phases have a similar weight, but are 
separated by large tunnelling barriers. 



However, this interpretation would be wrong, as evinced by Figure 7.3 In it we 



represent a scatter plot of the staggered magnetisation against u for the 100 runs of 



Figure 7.2 It is readily apparent that the two states correspond to systems with op- 

posite sign of the order parameter ml ' = 0.5023(3), ml ' = —0.543(3). Therefore, 
the observed metastability would not correspond to jumps between an antiferro- 
magnetic and a paramagnetic phase (the phase transition we want to study), but 
would rather reflect jumps between two antiferromagnetic states with different spin 
orientation^ 



^One could think that these two states should be symmetric and have the same energy, but 
remember that the number of spins in the odd and even sublattices is not the same for this random 
system, thus breaking the usual Z2 symmetry usually associated with antiferromagnets. 
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This latter problem is intrinsic to the canonical description. Even if we could 
devise a more efficient thermalisation algorithm (or use a much faster computer) 
our results would still be contaminated by this spurious metastability and their 
statistical analysis dominated by extremely rare events. 



CHAPTER VIII 



The DAFF in the tethered formalism 



We have just seen how traditional Monte Carlo methods are not well suited for the 
simulation of the DAFF. This was not only a case of thermalisation problems, but 
also revealed intrinsic limitations in the canonical approach, where the statistics 
is dominated by extremely rare events. In this Chapter we intend to demonstrate 
how a tethered approach can solve many of these difficulties. 



8.1 

The DAFF in the tethered ensemble 



Our canonical investigation revealed that in the DAFF one has to deal with free- 
energy barriers separating states with different staggered magnetisations. There- 
fore, the appropriate reaction coordinate to tether is mg. In addition, we are in- 
terested in a transition at non-zero magnetic field, where metastability could also 
appear. Therefore, we also tether the regular magnetisation m. Notice that by doing 
this, we do not need to specify the applied field for the tethered simulations. We 
simply simulate at zero field and then reweight the results as we did for the Ising 
model. 

Let us consider the tethered description of a single sample. Recalling Sec- 
tion 



4.1.1 we need a two-variable effective potential, whose gradient defines the 



tethered field, now a two-dimensional vector. 



VON{m,ms) = ( ^ , ^ 1 = ((&)m,ms/ {h)m,fns) (8.1) 



113 



114 



The DAFF in the tethered formalism 



In this equation 



b 
k 



1/2 -l/N 



m 
1/2- 



- m 
l/N 



mc 



(8.2a) 

(8.2b) 



Then, the canonical averages in the presence of an external magnetic field with 
a regular component h and a staggered component hs are related to the tethered 
expectation values through the Legendre transformation 



{0){h,hs 



Jdmf dms (C'>m,?«s e' 



-N[npj{m,7ns)—tnfih—ms 



J din J drhs e-^[^N{m,ms)-ni^h-msPhs\ ' 

although here we are interested in /zg = 0, so we shall use the notation 

{0){h) = {0){h,hs = ^). 
And furthermore, we shall use the following shorthand 



and 



ci^{m,m^) = nfq{m,m^) - ^hth, 



Vn^^\m,ms) = {{b),,,^-^h, {bs)^,,). 



(8-3) 



(8.4) 



(8-5) 



(8.6) 



According to the procedure described in Section 3.2 a tethered study of the DAFF 
would consist in a set of tethered simulations for fixed values of (m, mg) in a two- 
dimensional grid. From the value of the tethered field {{b)^ , (&s)m ^s) would 
then reconstruct nfq{m,ms) and use (8.3 1. 

This is not, however, the most practical approach. For instance, it would involve 
the non-trivial numerical computation of a two-dimensional potential from its gra- 
dient, evaluated in a discrete grid. Instead, we note that, for a given h, only a very 
small region of the reaction coordinate space {m,m^) will have a relevant weight. 
In fact, the integral ( |8.3[ > is going to be dominated by the minima of the effective 
potential (recall our discussion of ensemble equivalence in Section 4^. These will 
be given by the saddle-point equations 



Vni;Vm,ms) = B = 0, 



(8.7) 



that is 



ran 



N 



dm 
dm. 



s/ m,ms 



0. 



(8.8) 



{h) 



In principle, some of these equations will correspond to local minima of Oj^ 
and some to local maxima or to saddle points in the strict sense (a minimum in 
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Figure 8.1: Result of a tethered investigation of the sample that we studied with canonical 
methods in Section 



7-3 



We join the two metastable states found in the canonical compu- 
tation with a straight line and measure along this line the tethered expectation values for 
the energy u and the projection of the tethered field Bf (top and middle, respectively). We 
simulated 281 independent points along the line, but only plot one in five with error bar 
to avoid cluttering the graph. The tethered field has many zeros, defining saddle points 
in (8.31. Finally, the bottom panel shows the (exponential of the) integral of Bf, defining 
the relative weight of the points along the path. Two peaks dominate, where the tethered 
values of the energy correspond to the two metastable states Ui and 112 that we found in 
Section 7.3 Notice the extremely low probability of the connecting region, which explains 



the difficulty to thermalise canonical simulations. 
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one direction, but a maximum in the other). In the case of local minima, the saddle 
points will correspond to metastable states, potentially relevant to the description 
of the equilibrium phase. Notice that we also obtained several metastable states 



in our canonical study of Section 7.3 but we were unable to know their relative 
weights. In the tethered approach, however, this is easily done by considering their 
potential difference, i.e., the line integral of B along a connecting path. 



Let us demonstrate this procedure for the sample of Section 7.3! where we had 



(h) 

identified two metastable states (two local minima oi , h\ the tethered nomen- 
clature). We first have to determine the values of {m,m^) that correspond to these 



saddle points using (8.8 1. This is easy, because our canonical study has given us 



the values of m^^^ and mg \ We can now use (8.2a), setting fcs = and b = ^h = 2.4. 

Therefore, we know that mg''* ~ m^ ^ + 1/2 and fh^'^ ~ m^'^ + 1/ (2(1 — ^h)). From 
these starting guesses the actual saddle points are readily found. We can connect 
them with any path (because B is a conservative field). Since the system is random, 
it is difficult to predict which will be the optimal connecting curve (in the sense of 
thermalisation), so we consider a simple straight line, 

{m{t),ms{t)) = (m(i),7fzi^^)(l - t) + (m(2),m?)). (8.9) 



The whole computation is depicted in Figure We performed tethered sim 



ulations for 281 values of the parameter t and measured the tethered expectation 
values of the energy (M)^(^f^ ^^(^^^ and the tethered field B. These are plotted in the 

top and middle panels of the figure (for the tethered field we plot its projection Bt 
along the path). 

Now, following ( |8.3[ >, the probability density or relative weight of the points 
along the path is just 

p{m{t),ms{t)) = e-^^NiMt),Mt))^ (8.10) 

where 0^^\t) is the line integral of B, with the integration constant chosen so that 
the whole weight is normalised. This probability density is plotted in the bottom 



panel of Figure 8.1 in both a linear and a logarithmic scale. We find that one of the 
two metastable has about ten times more weight than the other (curiously, fewer 
canonical simulations found the more relevant state). In addition, the two resulting 
peaks in the p{m{t),ms{t)) are separated by a region with very low probability, 
explaining the difficulty of canonical simulations to tunnel between the two states. 
Interestingly enough, even within this low-probability sector we can see a rich 
structure of the p{rn{t),fhs{t)) — equivalently, of the effective potential. In other 
words, we are seeing a quantitative example of a rugged free-energy landscape. 

8.I.I Self -averaging and the disorder average 

In order to perform a quantitative analysis of the DAFF, we have to simulate a large 
number of samples and perform the disorder average. The naive way of doing this 
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for a system with quenched disorder would be to measure B and construct for 



each sample, then use Eq. (8.3 1 to compute all the physically relevant {0){h). Only 
then would we average (O) {h) over the disorder. 

This approach is, however, paved with pitfalls. First of all, computing a two- 
variable Clj^{m,ms) from a two-dimensional {rh,in^) grid is not an easy matter. In 
the previous section we avoided this problem by computing the saddle points first 
and then evaluating D]v only along a path joining them. But this cannot be done 
efficiently and safely for a large number of samples. Even if it could be done, the 
free-energy landscape of each sample is very complicated, with many local minima, 
several of which could be relevant to the problem, so a very high resolution would 
be needed on the simulation grid. 

Finally, even reliably and efficiently computing the canonical averages {0){h) 
would not be the end of our problems. Indeed, as we discussed in Section |2.2.2[ 
the canonical expectation values suffer from severe self-averaging violations. In this 
section we address all these problems and demonstrate the computational strategy 
followed in our study of the DAFF. 

The first step is ascertaining whether the tethered averages themselves are self- 
averaging or not. We already know that, since we are going to be working with a 
large regular external field h, the relevant region for the regular magnetisation m 



(and hence rh) is going to be exceedingly narrow (cf. Section 5.3 where the fields 
were much smaller). Therefore, we are going to explore the whole range of mg for 
a fixed value of m = 0.12 (this smooth magnetisation is in the range where the 
saddle points typically lie for the applied fields we considered in Chapter [7|. 

We have plotted the staggered tethered magnetic field (&s)m=o.i2,ms for 20 sam- 
ples of an L = 24 system at j6 = 0.625 in Figure |8.2| — top. The different curves have 
a variable number of zeros, but all of them have at least three: one in the central 
region and two roughly symmetrical ones for large staggered magnetisation. The 
positions of the two outermost zeros clearly separate two differently behaved re- 
gions. Inside the gap the sample-to-sample fluctuations are chaotic, while outside it 
the sheaf of curves even seems to have an envelope. This impression is confirmed 



in the bottom panel of Figure 8.2 where we show the sample-averaged tethered 
magnetic field for several system sizes. 

In order to quantify this observation we can study the fluctuations of the disorder- 
averaged (&s)^,^3. 



This quantity is plotted in the left panel of Figure |8.3[ As we can see, it goes to zero 
as a power in L, so we also plot fits to 

a^(m = 0.12,ms) = A(ms)L-'^^'\ (8.12) 



This would seem like a very good sign, because it could be indicative of self- 
averaging behaviour. However, it is not the whole story. If we recall the bottom 
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Figure 8.2: Top: Staggered component of the tethered magnetic field, {h)m,ms for m — 
0.12 as a function of mg. We plot the results for several samples of an L = 24 system at 
/S = 0.625. The curves are cubic splines interpolated from 33 simulated points. We note 
that the errors are very small (invisible at this scale), so the fluctuations of the curves are 

not artifacts of the interpolation scheme. Bottom: Disorder-averaged {hs),^^^ for the same 
smooth magnetisation and temperature as the left panel, for all our system sizes. The plot 
shows the different behaviour of the regions inside the gap, where the staggered magnetic 
field goes to zero as L increases, and outside it, where there is a non-zero enveloping curve. 
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Figure 8.3: Le/f; Sample-to-sample fluctuations in the staggered component of the tethered 
magnetic field, Eq. ( 8.ii| , against system size. We show two sets of points, both at /3 = 0.625 
and m = 0.12, but for different values of ths- The red curve, corresponding to ths = 0.7, 



is right inside the gap defined by the outermost zeros in Figure 8.2 while the blue curve 
{rhs = 1.1) is outside. Both are shown to decay with a power law. Right: Disorder average 

{hs)mms systems of the left panel. Inside the gap, the field goes to zero with L; 

outside it has a finite limit. 



panel of Figure 8.2 we see that inside the gap the tethered magnetic field itself, not 



only its fluctuations, goes to zero as L increases. In fact, as shown in Figure 8.3 

for mg = 0.7, 
tuations R 



while {h 



m,ms 



L This means that the relative fluc- 



, Eq. (2.41), do not decrease with increasing L. For the point outside 
the gap, however, the disorder average of the tethered magnetic field reaches a 
plateau. Furthermore, the fluctuations decay with c = 2.94(7), which is compatible 



with the behaviour ^ L ^ of a strongly self-averaging system (in the language 



2.2.2 



of Section 

In physical terms, this analysis means that the saddle point defined by a small, 
but non-zero, value of the applied staggered magnetic field h^ would be self- 
averaging. We can now recall the well-known recipe for dealing with spontaneous 
symmetry breaking (see Section 54^: consider a small applied field and take the 
thermodynamical limit before making the field go to zero. Translated to the DAFF, 



this means that we should solve the saddle-point equations (8.8) on average, rather 
than sample by sample, and then take the — > limit on the results. 



do 



N 



dm 



m,ms 



N 



dm. 



s/ m,ths 



0+. 



(8.13) 



In other words, we are considering the disorder average of a thermodynamical 
potential, Djv, different from the free energy. This approach was first introduced 
in [|fero8[ , in a microcanonical context (the averaged potential was in that case the 
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Figure 8.4: Disorder-averaged pdf of rhg conditioned to m = 0.12 for the system of Fig- 



ure 8.2 Compare the simple structure of this graph with the bottom panel of Figure 8.1 



entropy). 

In order to understand the limit — > 0"*", it is convenient to keep in mind the 
analysis of the p(m) for the Ising model in Sections 5.3 and 5.4.1 A small but 



positive hs exponentially suppresses the negative staggered magnetisation region 
and, as its value is decreased, it is equivalent to considering a 'smeared' saddle 
point and averaging over the whole sector with mg > 1/2. A small but negative 
hs would have the corresponding effect in the rhs < 1/2 region. Since most of the 
interesting disorder-averaged observables are symmetric with respect to ms = 1/2, 
we can gain statistics by averaging over the whole rfis range. 

By doing this for fixed m = niQ we obtain the probability distribution of 7%, 
conditioned to m = rfiQ, which we will denote by p(ms|^o) (Figure 84]). This 
probability density function can be used to average over rhs for fixed m. 



dfhs p{ms\m) (O) 



m,ms 



dm, (O) 



-NQj^imslm) 



where 



an 



dm. 



m,ms 



(8.14) 



(8.15) 



The zero in ^^(mslm) is chosen so that p(ms|m) is normalised (so 0]sj{ms\m) dif- 
fers from the two-dimensional r}^/(m, mg) in a constant term). In practice, since 
the p{ms\m) is not symmetric sample by sample, but only for the disorder aver- 
age, considering the whole rhs range for A/'gampies disorder realisations is roughly 
equivalent to considering only the positive sector for 2A/'samplesQ 



^Compare with Section 5.3 where the average for ±h netted a very small error reduction for 
even observables, due to the correlation in the data. 
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With this process we have integrated out the dependence on mg- We now have 
a series of smooth functions (0)^^, together with a smooth one-to-one fu nction 
h{m) = {b)^^/(i. Recalling our analysis of ensemble equivalence in Section 54.1 
we see that {0)^(j^-^ and {0){h) are simply two quantities that tend to the same ther- 

modynamical limit. Furthermore, for finite lattices the tethered definition (O)^ is 
better behaved statistically and arguably more faithful to the physics of an experi- 
mental sample. Therefore, we shall make the identification 



{0){h) = {0)^, (8.16) 
exact in the thermodynamical limit, even for finite lattices. 



8.2 

Our tethered simulations 

We can infer several useful conclusions from the analysis of the previous section 

• The disorder average should be performed on the tethered observables, before 
computing the effective potential. 

• It is best to analyse several values of rh separately, since the average over 
rhs for each fixed m can be unambiguously related to the canonical average 

{0){h) via h(m) = (fc)^//3. In this way, we can study the phase transition 
that arises by varying the applied magnetic field at fixed fi. 

• For fixed m the conditioned probability p{ms\m) has two narrow, symmetric 
peaks, separated by a region with extremely low probability. 

Therefore, we have carried out the following steps 

1. Select an appropriate grid of m values. This should be wide enough to include 
the critical point for the simulation temperature, and fine enough to detect the 
fluctuations of (O)^. These turn out to be very smooth functions of rh, so a 
few values of this parameter suffice, as we shall see. 

2. For each value of rh, select an appropriate grid of mg. We start with evenly 
spaced points and after a first analysis add more values of rhg in the neigh- 
bourhood of the saddle points, as this is the more delicate and relevant region. 



In contrast with our study for the Ising model (Section C.2I, the steepness of 



the peaks makes this second step crucial here, even for small lattices. 

The simulations for each (rh,rhs) are carried out with the Metropolis update 



scheme of section 3.2.1 In addition, we use parallel tempering (cf. Ap- 



pendix |A]). This is not needed in order to thermalise the system for L < 32, 
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L 


samples 


A/V 










8 


1000 


20 


5 


31 


7.7 X 10^ 


7.7 X 10^ 


12 


1000 


20 


5 


35 


7.7 X 10^ 


7.7 X 10^ 


16 


1000 


20 


5 


35 


7.7 X 10^ 


7.7 X 10^ 


24 


1000 


40 


5 


33 


7.7 X 10^ 


9.3 X 10^ 


32 


700 


40 


4 


25 


1.5 X 10^ 


5.5 X 10^ 



Table 8.i: Parameters of our simulations. For each of the A4ampies disorder realisations 
for each L, we run A/^, x Arfi^. tethered simulations with temperature parallel tempering. 
The N't participating temperatures are evenly spaced in the interval [1.6,2.575]. For each 
size we report the minimum number of Monte Carlo steps (Metropolis sweep + parallel 
tempering). After the application of our thermalisation criteria, some of the simulations 
for L > 24 needed to be extended, leading to a higher average number of Monte Carlo 
steps (ACc)- 



but it is convenient since we also study the temperature dependence of some 
observables. Furthermore, the use of parallel-tempering provides a reliable 
thermalisation check (Section A.3.1). Section [C.4| in Appendix [C| contains 
some information about our numerical implementation of TMC for the DAFF. 



From Figure 8.4 we see that, if we only want to compute the {0)(h), the tethered 
simulations for mg away from the peaks are worthless, since their contribution 
to the average over rhs is exponentially suppressed by a huge factor. We could 
have run tethered simulations only in a narrow range of 7% around the peaks, 
as we did in Section 



5.4.1 



As we shall see, however, some physically relevant 
quantities require that the whole range be explored, the most conspicuous being 
the hyperscaling violations exponent 9. 



The parameters of our simulations are presented in Table 8.1 The table lists the 
number J\ffh of values in our m grid and the number of points in the nis grid for 
each, so the total number of tethered simulations for each sample is Mm x Mms- 

The number of Monte Carlo steps in each tethered simulation is adapted to the 
autocorrelation time (see next section), the table lists the minimum length 
and the average length M^^^ for each lattice size. 



8.2.1 Thermalisation and metastability 

We have assessed the thermalisation of our simulations through the autocorrelation 
times of the system, computed from the analysis of the parallel tempering dynam- 
ics (see Section A.j.i^ We require a simulation time longer than lOOT^ntj^ This 
process is only followed for L > 24. For smaller sizes we have simply made the 
minimum simulation time large enough to thermalise all samples. 



^For most simulations Tint — Texp, so this value is much larger than what is required to achieve 
thermalisation. This ample choice of minimum simulation time protects us from the few cases 



where Texp is noticeably larger than Tint- Cf. Section A.3.3.2 
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Figure 8.5: Histograms of thermalisation times for our L = 32 simulations, at two values 
of thg for m = 0.12. Notice the logarithmic scale in the horizontal axis, which is in units of 
300 parallel-tempering steps. The saddle point (the peak in the probability distribution) at 
this m = 0.12 corresponds to fhs = 1.078. Notice that we cannot measure times shorter than 
our measuring frequency of 300 parallel-tempering steps, so the first bin should be taken 
as encompassing all shorter autocorrelation times. Only the samples with t > 2^ have to 
be extended from the minimum simulation time. 




Figure 8.6: Same as Fig. 8.5 but now we consider several values of m for nig = 0.8 (in the 
hard thermalisation region far from the peak). The points for m = 0.12, closer to the critical 
point, exhibit a heavier long-time tail (mind the vertical logarithmic scale). 
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The distribution of correlation times for our different samples turns out to be 
dependent on the value of (m, mg). Considering first the variation of the average 
Tint with fhs at fixed rn, we see that the peak of the p{ms\rn) and its adjoining 



region are much easier to thermalise (Figure 8.5). This region coincides with the 



only points that have a non-negligible probability density (Figure 8.4), i.e., the only 
points that contribute to the computation of the (0)„»j. This fact suggests a possible 



optimisation, that we will discuss in Section 8.7 



A second interesting result comes from studying the evolution of the T[„t with 
m. Figure 8.6 represents the histogram of autocorrelation times for rhg = 0.8 (in 
the 'hard' region) for two values of m. The distribution for m = 0.12 has a much 
heavier tail. As we shall see, this is due to the onset of a phase transition. 

The difficulty in thermalising some samples stems from the coexistence of sev- 
eral metastable states, even for fixed (m, mg). In Figure 8.7 we represent the time 
evolution of the energy u for several temperatures of the same sample (L = 32, 
m = 0.12, rfis = 0.8). As we can see, for a narrow temperature range several 
metastable states compete. This has a very damaging effect on the parallel temper- 
ing dynamics, whose flow is hindered whenever a configuration that is metastable 



for one temperature is very improbable in the next (see Figure 8.8 ). 

For L = 32, some point^ presented a metastability so severe that enforcing a 
simulation time longer than lOOZint would require a simulation of more than 10*^ 
parallel-tempering updates (one thousand times longer than our minimum simu- 
lation of ^ 10^ steps). Thermalising these points (which constitute about 0.1% of 
the total) would have thus required some 10^ extra CPU hours, with a wall-clock 
of many months. We considered this to be disproportionate to their physical rele- 
vance (they are all restricted t o a region far from the peaks where the probability 
density is < 10~**^, see Figure 8.4). Therefore, we have stopped these simulations 
at about lOzint- This is still a more demanding thermalisation criterion than is 
usual for disordered systems and does not introduce any measurable bias in the 
physically relevant disorder-averaged observables. This can be checked in several 
ways: 

• First of all, as we have already discussed, these points are restricted to a region 
in rhs with probability density of at most 10"^*^. Therefore, even if there were 
a bias it would not have any effect in the computation of canonical averages. 

• Even at the most difficult values of {m,rh^) the log2-binning plot (the only 
thermalisation test typically used in disordered-systems simulations, see Sec- 
tion A. 3) presents many logarithmic bins of stability (Figure 8.9 blue bars). 



Even if we subtract the result of the last bin from the others, taking into 



account statistical correlations (see Section |A.3| and jFER07] ), several bins of 
stability remain (Figure 8.9 red bars). This is a very strict test, and one that 



even goes beyond physical relevance (because it reduces the errors dramati- 
cally from those given in the final results). 



3By 'point' we mean any of the A^amples x J^m^ individual tethered simulations for each L. 
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Figure 8.7: Time evolution (in number of parallel tempering steps) for the energy density u 
for the same sample of an L = 32 system at m = 0.12, rhg = 0.7 and different temperatures. 
For most temperatures the equilibrium value is quickly reached, but for a very narrow 
temperature range there are several competing metastable states (bottom-left panel for 
T = 1.901). 




Simulation time 

Figure 8.8: Temperature random walk for one of the 40 replicated systems for the parallel 



tempering simulation of the sample in Figure 8.7 The flow is blocked at the same tem- 
perature that had several metastable states (marked with a horizontal line), see bottom-left 



panel in Fig. 8.7 



126 



The DAFF in the tethered formalism 



0.0003 



0.0002 



0.0001 







3 

Bin 



Figure 8.9: Time evolution of (5s)m)n. ^ — 0.12, = 0.5, in a logarithmic scale (the 
first bin is the average over the second half of the simulation, the second bin the average 
over the second quarter and so on, cf. Section A.3I. This is in the middle of the {m,m^) 
region where thermalisation is hardest. The blue bars are the actual errors of our results 
and the red bars mark the error in the correlated difference of each bin with the first. Even 
for these reduced errors, we have several bins of convergence. 



8.3 

The effective potential 



As we discussed in detail in Section 8.1.1 we can learn much about the physics of 



the DAFF by studying the disorder-averaged saddle-point equations (8.13), which 
amounts to studying the disorder-averaged tethered field. To this end, we have 



plotted both components of the tethered field in Figure 8.10 Let us first consider 
the staggered component. In the region we are considering, the corresponding 
equation has three solutions for each m: one in the paramagnetic region at = 0.5 

and two symmetric ones that get closer together as we lower m 4] Notice that (&)^ 
is negative, therefore when we lower m we are actually tncreasmg \m\^ 

The other equation, {b)jf^^^ = /3/i, has two symmetrical solutions for each m 
(for values of h in the range). The resulting structure of the effective potential 

is better understood with a two-dimensional plot of B = — l^h, {h)^^^^^^, 



4The tethered field is actually not exactly symmetric in wis, there are deviations in the tails due 
to the way we have added the Gaussian demons when constructing in 1 8.2a I. However, we can 
consider the peak positions symmetric with great precision. 

5The minus sign is an awkward remnant from the sign convention for the tethered field used 
in [FER09C], but inverted in [feriib] after, alas, having finished the simulations. We would obviously 
have obtained a positive range of h if we had simulated the m region for the opposite range of m. 
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Figure 8.10: Left: Staggered component of the tethered field as a function of mg for all the 
values of m in our L = 32 simulations. Right: Same plot for the regular component of the 
tethered field. 



-(h) 

which is the gradient of Oj^ {m.,rns) = ri^{m,ms) — ^hm. We have plotted this 



vector in Figure 8.11 for = — 1.33|^ There are three stationary points (circles in 
the figure): 

• Two in the antiferromagnetic region, symmetric with respect to rfig and at the 
same rit. These are minima of the effective potential and, therefore, maxima 



of the probability density p{rh,ms;h) (cf. Figure 8.12). 



• One in the paramagnetic region, at rhs = 0.5 and at a different value of m from 
the other two. This is a saddle point, it is a maximum in the rfis direction and 
a minimum in the m direction (by symmetry considerations, these are its 
principal directions). 

We can use this analysis to make a first qualitative assessment of the order of the 
DAFF's phase transition. Indeed, one of the most important features of a first-order 
transition is the emergence of metastability. The system, close to the transition 
point, has two metastable states, one ordered and the other disordered, charac- 
terised by peaks in the p{m,in^;h). As we modify the magnetic field, the relative 
weight of the peaks changes, until, at the transition point p{nf'^^^'^^^, m°^'^^^^'^; he) = 
pl^j^disordered j^disordered. ^^-j This is equivalent to the Maxwell construction in a 
microcanonical setting. However, in our analysis we have found no peak in the 
paramagnetic region, which would rule out the first-order scenario. 

How, then, can we explain the metastable behaviour observed in previous work? 
Actually, our preliminary study of the DAFF in the canonical ensemble in Sec- 
tion [73] had the answer: the observed metastability is caused by the two coexisting 
antiferromagnetic peaks. 



Let us consider Cl{m^\m) in Figure 8.12 In a canonical simulation the system 



would spend most of its time in one of the two antiferromagnetic minima, only 

^This choice seems arbitrary, but we shall see later that it corresponds to the critical h for this 
temperature. 
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Figure 8.ii: Plot of B = ((&),^ - ^h, {bs)yf,^^J = Vn^^) for = -1.33. There are three 
stationary points (circles): two symmetric antiferromagnetic minima at ?« ~ 0.12 and a 
paramagnetic saddle point at m ?a 0.093. The dashed arrow marks the integration path 



that we will use in Section 8.5.2 
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Figure 8.12: Effective potential nj^{7ns\m = 0.12), Eq. (8.15 1. The errors can hardly be seen 
at this scale (they are smaller than the width of the lines for the larger sizes). 
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very rarely tunnelling from one to the other. Notice that, even though the barrier 
AD]v decreases with L, the escape time actually goes as exp[NAnN], thus causing 
an exponential critical slowing down (cf. Figure P4I showing exp[NADN] ^ 10'^*^ 
for an L = 32 system). The precise behaviour of these free-energy barriers is of 
paramount importance to a characterisation of the physics and we shall dedicate 
Section |8.5.2| to its investigation. 



8.4 

Finite-size scaling study of the phase transition 

The study of the effective potential in the previous section found no evidence of 
first-order behaviour in the DAFF, suggesting that its phase transition is continu- 
ous. Of course, promoting this statement to more than a working hypothesis would 
be premature, for several reasons. Not the least of which is that we have not yet 
shown that there actually is a phase transition in the studied (m, ms) region. In 
the remaining sections we shall both find this phase transition and characterise its 
properties. We shall first work at fixed /3 = 0.625 and find the phase transition that 
appears when varying h and we shall then examine the temperature behaviour at 
fixed h. 



84.1 Scale invariance 

One of the clearest signals of a continuous phase transition is the presence of scale 
invariance in the system. Therefore, our first step is computing the correlation 
length. 

Recalling definition ( 7.9 1 of the staggered Fourier transform we define the fol- 
lowing propagator 

F,nik) = N {cp{k)cp{-k)) (8.17) 
which we use to define the second-moment correlation length (explained in Sec- 



tion |5]i]) 



2sin(7t/L 



m{h) 



(0) 



1 1/2 



1 



m-{}i) ' 



(8.18) 



Notice that we write ^li^) instead of ^2{rh), following our discussion of ensemble 



equivalence in Section 8.1.1 



The result of this computation is plotted in Figure 8.13 In it we can see a 



clear occurrence of scale invariance, signalling the onset of a second-order phase 
transition at ~ —2.13. As a welcome bonus, the dependence of ^ on is linear, 
which will simplify the analysis. 

We can obtain a complementary picture by fixing the magnetic field and study- 
ing the temperature dependence. In Figure 8.14 we show this for h = —2.13. For 
each temperature and lattice, ^{h = —2.13) is obtained by linear interpolation in 
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Figure 8.13: Correlation length in units of the system size as a function of the external 
magnetic field h, for j6 = 0.625. The clear crossings at /z ~ —2.13 signal a second-order 
phase transition. 




Figure 8.14: Plot of ^/L as a function of the inverse temperature /5 for h = —2.13. This is 
the approximate value of the critical field at j6 = 0.625 (cf. Figure 8.13! ^ which explains the 
convergence at the end of our temperature range. 
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a plot analogous to Figure 8.13 This value of h causes the critical temperature to 
appear at the very end of our simulated range. The reason for this choice is that 
the rhg grid for each m was optimised for /3 = 0.625. As we move away from this 
temperature the peaks are only irregularly sampled, which causes larger statistical 
errors and potentially discretisation biases. 



8.5 

Computation of critical exponents 

From the previous plots we can perform a Finite-Size Scaling | iAMi03[ , [zj03| analysis 



we 



in order to determine the critical exponents. As we discussed in Section 7.2.1 
need three independent critical exponents in order fully to characterise the phase 
transition of the DAFF, which we can choose as v, f], 9. In the following sections we 
shall compute these three exponents and then use additional observables to check 
the scaling and hyperscaling relations. 



We shall apply the quotients method | |bal96[ , which is based on Nightingale's 



phenomenological renormalisation jNiG73] . Let us consider a system that experi- 
ences a second-order phase transition at a critical point Xc (where the parameter 
X is, in our case, either the applied magnetic field h or the temperature). If some 
physical observable O behaves as (X — Xc)~J^" = x~3/£) near the transition point, then 



we expect the following dependence on a finite lattice (cf. Section 2.1.3 neglecting 
corrections to scaling, 

0(L,x) = LV^^^^foixL^^'^). (8.19) 

In this formula Vx is the exponent of the correlation length, ^ oc and fo is a 
smooth universal scaling function. 

In order to define the quotients method, we write ( 8.19[ > in the following form 



0(L,x) = Ly"'"^Fo{aUx)/L). (8.20) 

Now, for a pair of lattice sizes L\ and L2 = sL\ we identify the single value of x 
such that ^(Li,x*)/Li = ^(L2,x*)/L2. At this point, we have 

0(L,x*) • ^ ' 



In Table 8.2 we have applied this method to several observables, using s = 2, 

— > yo = Vx + l, (8.22) 
Ml y^ = j = 2^-3vx, (8.23) 

dxM — > xfo = a. (8.24) 

In the following sections we shall analyse each critical exponent separately and 
compare the results with previous numerical and experimental work and with 
analytical conjectures. 
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L 


h*iL) 






a/vf, 




8 


-2.178(4) 


0.0125(7) 


0.887(5) 


0.0765(25) 


1.07[7](5) 


12 


-2.140(5) 


0.0104(5) 


0.790(9) 


0.0781(27) 


1.01[3](3) 


16 


-2.123(3) 


0.0119(4) 


0.742(7) 


0.224(4) 


1.10[13](7) 



Table 8.2: Computation of the critical exponents with the quotients method, applied to 
pairs of lattices (L, 2L). The first four columns give results from the quotients at h*{L) for 
fixed j6 = 0.625. The last column gives results for j6*(L) at fixed h = —2.13. For this last 
value we give two error bars, distinguishing systematic and statistical errors. 



8.5.1 Exponents v and jS 

As a first application of the quotients method, we can compute the correlation 
length critical exponent, using equations (8.21) and ( [8.22 1. This exponent can be 



computed either from d^^ (Figure 8.13 1 or from d^^ (Figure 8.14). 

In the first case, since the h dependence is linear in all our simulated range, we 
simply fit ^{h) to a straight line and approximate 9^^ with its slope (as usual, we 
perform a different fit for each jackknife block) . 

As to the temperature dependence, the analysis is more delicate for two reasons. 
The first is that ^(jS) is no longer a straight line, the smaller sizes showing a clear 
curvature even very close to the intersection point. The second is that, as discussed 
above, the value of the correlation length for temperatures other than j6 = 0.625 
has some chaotic fluctuations due to discretisation error. Therefore, even if the 
data are correlated (as is the case for parallel tempering simulations), computing 
the temperature derivative is difficult. 

Our solution has been to fit the high-/3 region to a quadratic polynomial for 
L < 16 and to a straight line for L > 24. The aforementioned systematic effects 
cause the derivative to depend heavily on the fitting range. We have accounted for 



this effect in Table 8.2 by giving a second error bar, in square brackets. 

Notice that exponents v/j and should coincide, but we obtain Vf^ ^ 0.75 and 
~ 1.05. This discrepancy is mainly due to large corrections to scaling. In gen- 
eral, as discussed in the introduction to this chapter, attempts to determine v have 



yielded values in a very broad range. We can see the reason in Figure 8.15 where 
we show scaling plots of ^(/3) for values of v in a very broad range. For each v 
there is a temperature range that shows a seemingly good scaling. Therefore, at- 
tempting to estimate v as the exponent that produces the best collapse, a method 
frequently employed for this and other models, is not only imprecise but also dan- 
gerous. By precisely locating the critical point and using the quotients method we 
minimise the effect of scaling corrections (but we do not eliminate them completely, 
as evinced by the incompatible values of v/, and v^). 

Notwithstanding these difficulties, notice that our range of values for v is in 
good agreement to that defined by the best experimental estimates: from v = 
0.87(7) |sLA99 a to V = 1.20(5) |jYE04|. 

We can determine the critical exponent 7/1/ by applying the quotients method 
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Figure 8.15: Scaling plots of ^/L for several values of the thermal critical exponent v. A 
very high value of v causes a misleading collapse for low L, while our preferred values 
(bottom panels) are better at collapsing the larger sizes close to the intersection points. 



to {Mj){h). The results are quoted in Table 
hyperscaling relations (Section 7.2. i[ >, 7/v = 4 



8.2 According to the scaling and 
- f] = 3 — 2/3/v. Our resulting value 
of /3/v is very low, but different from zero. Recall that this exponent controls the 
evolution of the peaks in p{m^;h) (cf. Sections 5.2.2 and 6.4). In Figure 8.16 we can 



see that, indeed, the zero in (fcs)m shifts slowly but surely with increasing L. The 
figure is for fixed m = 0.12 fh{hc). 

The extremely small value of (i/v is one of the reasons for past claims of a 
first-order phase transition and is consistent with previous numerical studies. 



8.5.2 The hyperscaling violations exponent 6 

The third and last independent critical exponent is 9, which gives a measure of the 
violations to hyperscaling, Eq. ( [7.17 1. This exponent cannot be obtained from the 
canonical averages of physical observables at he, as we did for /3 and v. Rather, 
its computation is an intrinsically tethered operation. Following Vink et al. [iviNioi 
iFisii ], we can relate 6 to the free-energy barrier between the two coexisting states 
at the critical point, 

Af N oc L^-°, (8.25) 
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Figure 8.16: Plot of the staggered component of the tethered magnetic field, {hs)^,^^ for 
m = 0.12 m(h(.) at /3 = 0.625. This is a close-up of Figure 8.2 — ^bottom, restricting the m 
range and eliminating the curve for L = 8 in order better to see t he L-e volution. The area 
under the curve between the two zeros gives the quantity Af j, Eq. (8.29 1, which we can use 
to compute 9. Notice the slow shift inwards of the antiferromagnetic zero, which marks 
the position of the peak in p{rhs\m). 



in this equation f ^ is the free-energy density, as in equation (2.10). This is a spe- 
cially interesting quantity to measure, since in a first-order transition 9 > D — 1. 
Indeed, in a first order scenario the system would tunnel between the two pure 
phases (paramagnetic and antiferromagnetic) by building an interface. The free- 
energy barrier would then be associated to a surface tension. This in turn would 
be proportional to the dimension of the interface's surface, which must be at least 
D-1. 

We first notice that AFjv is defined as the free-energy barrier between the two 
phases (disordered and antiferromagnetic) that form at the critical point (/3,/Zc). 



Recalling our study of section 8.3 we see that this is equivalent to computing the 
AOf^ between the paramagnetic saddle point and one of the antiferromagnetic 
minima. We denote these two points by (m^^\ms^'^) and {m^^\mi^''), respectively, 

AdJ;) = n^^\m('\4'^)-n^^\m(^\mP). (8.26) 
As always, this potential difference is simply the line integral of 



B={{b)„y^^-fiK {bs)n,,n,J (8.27) 



along any path connecting (m^^\mi^^) and {m^^\m 



(2)^ 
S , 
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Figure 8.17: Tethered average ^^=05 lattice sizes at /3 = 0.625. 

In our simulations we have not covered the (m, ms) plane uniformly. Rather, we 
have simulated several slices with fixed m. Therefore, the most convenient way to 

evaluate AOj^ is 

1. At m = 0.12 rtic = rn{hc), compute the potential difference between 
(mc,m*) and (mc,0.5), where m* is the position of the peak of p{nis\mc) No- 
tice that, since we are working at constant m, 



n^^\rhc,fhs = 0.5) - n^J^\fhc,ml] 
This is just the integral 

AFi = Ojsi{0.5\inc) — ON{m*\mc) = 



ON{0.5\mc) — Of^{ml\mc)- (8.28) 



Bdii 



0.5 



dmg {h 



(8.29) 



In other words, AFi is just the area under the curve in Figure 8.16 We can 
average AFi for the symmetric path in the region with negative staggered 
magnetisation (mg < 1/2). 

2. The tethered average at (mc, m*) defines a value of fihc through the saddle- 
point equation, 

'W)m,,nrt = /^^c • (8.30) 

3. Consider the tethered values (^)^ ^^=0 5 a function of m and interpolate the 
value in* that satisfies 



(8.31) 
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L 




Fit range 





X^/d.o.f. 


8 


0.03382(29) 


L > 8 


1.448(9) 


5.56/3 


12 


0.01756(15) 


L > 12 


1.469(13) 


0.44/2 


16 


0.01138(9) 


L > 16 


1.461(20) 


0.16/1 


24 


0.00608(5) 








32 


0.00392(5) 









Table 8.3: Computation of the hyperscaling violations exponent 9 from the free-energy 
barriers AF^. We report fits to (8.34 1, for different ranges. Our preferred final estimate is 
9 = 1.469(20), taking the central value of the fit for L > 12 and the more conservative error 
of the fit for L > 16. 



This turns out to be very easy to do because the (fc),^ ms=0 5 ^^^^ ^ straight 
line (Figure [8. 17[ >. 

4. Compute the potential difference between (mc, ms = 0.5) and (m*,ms = 0.5). 

B.dl2= dm ((&)^,^3=o.5-iSM- (8-32) 

Jnic 

5. The sum of AFi and AF2 is equal to the integral of B along a path joining the 

two saddle points {m^^\mi^'') = (m*,0.5) and {m^^\m''p) = (mem*), i.e., 
the free-energy barrier we were looking for, 

AFn = AFi + AF2. (8.33) 



This integration path is depicted with a dashed line in Figure 8.11 on page 128 



Notice that, in accordance with the study of Section 8.3 AFi will be positive 
and AF2 negative. 



We have implemented this procedure for all our lattices. Table 8.3 shows the 
resulting AF^ and fits to 

AFn = AL3-^ (8.34) 



As our preferred final result we can give the central value of the fit for L > 12 
with the error of the fit for L > 16. Therefore 

e = 1.469(20). (8.35) 

Notice that 6 is close to the value 9 = D/2 that one would expect from a naive 
Imry-Ma argument | ,imr73] . In particular, 9 < D — 1, and therefore the metastable 
states do not define stable phases, which is yet another argument against a first- 
order transition. 

Notice that the value of > is also the reason for the exponential slowing 
down of this model. Indeed, we saw in Section |8.3| that the system gets trapped 
in local minima with escape times T ^ exp[NAFjvJ ^ exp[L^] causing a thermally 
activated critical slowing down |Ifis86a|,[nat98[. 



8.5 — Computation of critical exponents 



137 



8.5.3 Scaling relations: the specific heat and a 

As discussed in the introduction to this chapter, there has long been disagreement 
as to whether the specific heat in the DAFF is divergent, as observed in experi- 
ments | bel83[ BEL98I. The specific heat critical exponent is generally difficult to 



obtain in numerical simulations and the self-averaging violations in the DAFF have 
not made things any easier liHAROn |wuo6l Imalo6L 

Since we already have a complete set of independent critical exponents, we can 
obtain a. from the hyperscaling relation ( |7.i7[ >. Using our range of values 0.75 < 
V < 1.1 and 6 = 1.469(20), we have the following bounds for a, 

0.32 < a < 0.85. (8.36) 

Clearly, the uncertainty in v does not allow a good determination, yet we can safely 
exclude a non-divergent specific heat from our data (a < would imply v > 1.3). 
Furthermore, even the logarithmic divergence {a = 0) suggested by experimental 
work seems excluded. Notice, however, that the experimental values of v together 
with hyperscaling relations, also imply a. > 0. For instance, the result v = 0.87(7) 
of |5lA99] , taken with our 9 = 1.469(20), would give a. ^ 0.67(10). Even the higher 
value V = 1.20(5) of [yeo4| would give a ^ 0.16(8). 

We can attempt a direct determination of a from our simulations. We start by 
defining the specific heat as 



C^^ (B.3., 

In principle, at the critical field, C cx L^^^, so we could compute a/v from C using 
the quotients method. Unfortunately, the quotients method is ill-suited to this 
quantity, which features a large analytical background, so its scaling is more aptly 
described by C ~ A + B'^^'^ [BAL98J. Therefore, one needs extremely large values of 
L to reach the asymptotic regime where C ~ L*/^. 



The behaviour of the quotients shown in Table 8.2 is consistent with the above 



expectation. Our results point to a divergent specific heat, but our estimates for 



the exponent a are clearly still very far from the asymptotic regime where (7.17) 
would be satisfied. An additional source of error is that our determination of the 
specific heat suffers from systematic effects due to the discretisation of rh. Indeed, 
while our m grid was more than adequate for a determination of 9m ^, which could 
be approximated by a straight line, the specific heat shows a clear curvature as a 
function of m. We have interpolated it with a quadratic polynomial in order to 
estimate the derivative, but we would have needed more resolution in m to make a 
safe computation. 

8.5.4 The two-exponent scenario and the experimental scattering 
line shape 

We can close our FSS analysis of the phase transition by considering the two- 



exponent scenario discussed in Section 7.2.1 Recall that according to this proposal. 
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6 = D/2 — f)lv. From (8.35) and /3/v = 0.011 1(8) (combining the last two rows of 
Table 8^ we have 



Q = 1.469(20), (8.38) 
D/2 - j6/i/ = 1.4889(8). (8.39) 

The two numbers are compatible. 

We can use our results for the critical exponents to shed some light on the exper- 
imental situation. In an experimental study, the critical exponents are computed 
from fits to the scattering line shape or structure factor S{k). We can write this 
quantity as 

S{k) = Sd{k) + Sc{k), (8.40) 



where and Sc are the connected and disconnected propagators defined in { 7.2^} . 
Recall that these diverge as 

Sd(fc)-fc-'+'? + ..., (8.41) 

Sc(fc)-fc-2+'? + ..., (8.42) 

where the dots represent subdominant terms. 

An experimental study of S{k) faces, then, the considerable challenge of distin- 
guishing two different divergences at A: = 0. One possibility is simply to disregard 
the difference and parameterise S{k) with an effective anomalous dimension, 

S{k) - fc-2+'7eff. (8.43) 

This procedure was employed in [¥£04], obtaining v = 1.20(5) and >/eff = —0.51(5). 
An alternative way, supported by our results, is to adopt the two-exponent 



ansatz (7.19 1, which can be rewritten as 2r] = rj. Then, the following approximation 

Sd(fc) = [Sc{k)f. (8.44) 



can be made: 



This avenue was followed in jsLA99| , yielding v = 0.87(7) and t] = 0.16(6). This 
value of V is right in the middle of our numerical bounds. However, taking into 
account ( 7.19} , we would have 



e = 2-?/ = 1.84(6), (8.45) 

which is clearly incompatible with our results. 

The answer to this problem is that the approximation (8.44 1 is probably too 
naive. Even accepting that f] = 2r], approximating all of S^, and not just its most 
divergent term, as the square of Sc is an excessive simplification. Clearly, a better 
theoretical parameterisation of S{k) is needed. Our methods are well suited to a 
direct numerical approach to this question, left for future work. 
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Figure 8.18: Equilibrium configuration for an L = 24 system at /5 = 0.625, m = 0.12, 
rhs = 0.5. 



8.6 

Geometrical study of the critical configurations 

The picture painted in the previous sections is that of a second-order transition; but 
one with quite extreme behaviour. Among its peculiarities we can cite an extremely 
small value of /3 and large free-energy barriers, features both that are reminiscent 
of first-order behaviour. The free-energy barriers in the DAFF, however, diverge 
too slowly to be associated to the surface tension of well-defined, stable coexisting 
phases. But this only raises the question of what kind of configurations can give 
rise to such a behaviour. 

In this section we study the geometrical properties of the minimal-cost spin 
configurations joining the two ordered phases at the critical point. To this end, we 
consider simulations at j6 = 0.625, m = 0.12 ~ m{hc) and rtis = 0.5. Recalling 
that rhs — + 1/2, this last condition expresses the fact that we are studying 
configurations with no global staggered magnetisation. This is a good example of 
an 'inherently tethered' study, that examines information hidden from a canonical 
treatment (cf . Figure |8.4[ where we show that the region considered here has a 
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L 


samples 


-interface 


12 


3000 


160.29(25) 


16 


3000 


304.62(41) 


24 


3000 


755.74(94) 


32 


700 


1446.1(39) 



Table 8.4: Masses of the interfaces for our equilibrium configurations at /3 = 0.625, m = 
0.12 ^ nic, nis = 0.5. This mass grows as A/interface ^"^^ From a fit, c = 2.2402(24), which is 
incompatible with our estimate of 6 = 1.469(20), confirming that the free-energy barriers 
are not associated to surface tensions. 



canonical probability density of ^ 10 '^'^ for an L = 32 system) 



Figure 8.18 shows an example of such a configuration for an L = 24 system. In 
order to make the different phases clearer, we are not representing the spin field 
Sx, but the staggered field SxTCx- As is readily seen, even if the global magnetisation 
is mg ^ 0, the system is divided into two phases with opposite (staggered) spin. In 
geometrical terms, most of the occupied nodes of the system belong to one of two 
large clusters with opposite sign. All the other clusters are orders of magnitude 
smaller, with most of the remaining spins forming single-site clusters. 

At a first glance, this picture may seem consistent with a first-order scenario, 
where the system is divided into two strips whose surface tension gives rise to the 
free-energy barriers (see jMMO/] for an example of these geometrical transitions in a 
first-order setting). In order to test this possibility, we can study the evolution of the 
interface mass with the system size and compare it with the explicit computation 



of free-energy barriers done in Section 8.5.2 



Given a configuration, we first trace all the geometric antiferromagnetic clus- 
ters. We then identify the largest and second largest ones. Finally, we say that an 
occupied node belongs to the 'interface' if it belongs to the largest cluster and has 
at least one first neighbour belonging to the second largest one. We have computed 
in this way the interface A/interface for our 700 L = 32 samples and for 3000 samples 
for all our smaller systems (we have run additional simulations just at this point). 



Table 8.4 shows the result of this computation. A fit to 

A/interface = ^^'^/ (8-46) 

for L > 12 gives c = 2.2402(24) with x^/d.o.t = 3.62/2. If the metastability of 
the system were to be associated to a surface tension, then the free-energy barriers 



would have to scale with c. However, we saw in Section 8.5.2 that the actual scaling 
is NAf N oc with 6 = 1.469(20). 

To understand the reason why this large interfaces are not responsible for 
metastable signatures, we have to look more closely at the geometry of the con- 



figurations. Indeed, we see in Figure 8.18 that there is a path connecting the spins 



in the green strip across the red one (there is only one green strip, since we are 
considering periodic boundary conditions). If we were to consider a complete to- 
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Figure 8.19: Probability of completing a path with constant staggered spin across the 



strip with opposite staggered magnetisation, see Fig. 8.18 for our spin configurations at 
^ = 0.625, nis = 0.5. 



mography of this configuration, we would find several of these paths (which, of 
course, need not be contained in a plane). In other words, the phases are porous. 
This is in clear contrast to a first-order scenario in which the phases have essen- 
tially impenetrable walls. Now, this could be a peculiarity of the particular selected 
configuration. In order to make the analysis quantitative, we shall examine all of 
our samples and determine the strip-crossing probability P. This is defined as the 
probability of finding a complete path with constant staggered spin across the strip 
with opposite staggered magnetisation and we can compute it with the following 
algorithm: 



1. For each configuration, compute the staggered Fourier transform (7.9) of the 



spin field at k^^, for each of the three axes {(px, (py, (pz)- 

2. In a strip configuration, one of the (p will be much larger than the other two. 
Assume this is (px, so the strips are perpendicular to the OX axis. 

3. Measure the staggered magnetisation Mg on each of the planes with constant 
X and identify the plane x = x^ax with largest | Mg | . This plane will be at the 
core of one the strips. 

4. Trace all the clusters that contain at least one spin on the x = Xmax plane, but 
severing the links between planes x = Xmax and x = Xmax — 1- 

5. If any of the clusters reaches the plane x = Xmax — 1 there is at least a path 
through the strip with opposite magnetisation (the previous step has forced 
us to go the long way around, so we know we have crossed the strip). 
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We have plotted the strip-crossing probability P(m) in Figure 8.19 Notice that 
1 — P behaves as an order parameter. If we keep increasing m, so that we enter the 
ordered phase, the phases eventually become proper impenetrable strips, hence 
P = for large enough systems. On the other hand, for low m, in the disordered 
phase, the strips become increasingly porous, so that P = 1 in the limit of large 
systems. The crossover between these two regimes manifests in the crossing of the 
P(m; L) at the phase transition point m ~ 0.12. 



8.7 

Beyond L = 32: optimising TMC simulations 

The simulations listed in Table |8.i| required more than 700 years of total CPU time 
and were carried out using a combination of large supercomputing facilities {Mare 
Nostrum of the Red Espanola de Supercomputacion), large computing clusters {Ter- 
minus at BIFI) and grid resources (Piregrid). Even with the parallel scheme ex- 



plained in Section C.4 the wall-clock for some of the toughest points has been of 
several months. 

Essentially, by tethering the magnetisations we managed to remove the largest 
free-energy barriers of the systems. However, for large sizes, new barriers, associ- 
ated to different reaction coordinates, began to appear. In order to eliminate these, 
one would have to tether additional quantities. 

It may seem, then, that the methods described in this Chapter cannot be ex- 
tended to larger lattices with current hardware. Actually, these simulations were 
intended to explore the physics of the DAFF thoroughly, perhaps sacrificing fo- 
cus for breadth. In a more targeted study, however, we can take advantage of the 
model's characteristics to optimise the simulations. In particular, we can highlight 
two interesting facts: 

• Only a very narrow region around the saddle points has any significant 
weight for reconstructing canonical averages (Figure 8./[\ . 



It is much harder to equilibrate the region far from these peaks (Figure 8.5 1. 



Combining these two observations, it turns out that, if our only interest is recon- 
structing canonical averages, we can achieve a qualitative reduction in simulation 
time by simulating only a narrow range around the peak in p(ms|m) for each value 
of the smooth magnetisation m. In fact, all the results discussed in this chapter. 



except for Sections 8.5.2 and 8.6 could have been computed in this simplified fash- 
ion. 

We have demonstrated this optimisation by simulating 400 samples of an L = 48 
system for m = 0.12. We use only J\fm^ = 6 (but we have to increase the number of 
temperatures in the parallel-temperi ng to 60, in the same range, in order to keep 

the exchange acceptance high). Table 8.5 shows the value of rn^^^^ as a function of 
L. 
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We can use the values of ri^c^^^ reported in Table 8.5 to attempt an alternative de- 
termination of j6/v, following the method described in Section 5.2.2 In particular, 
we perform a fit to 



\m 



peak 



1/21 



This is plotted in Figure 



8.20 



The result of the fit, with p^^/d.o.f. = 2.98/3, is 
|6/v = 0.011 1(7). 



(8.47) 



(8.48) 



This result is compatible with our computation with the quotients method in Ta- 
ble 8.2 Notice, however, that it does not take into account the uncertainty in m^ (in 
Section 5.2.2 we worked at the exact j6c and even in Section 6.4 we knew j6c with 
many significant figures). 
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L 


-^samples 






8 


1000 X 2 


31 


0.58585(87) 


12 


1000 X 2 


35 


0.58239(54) 


16 


1000 X 2 


35 


0.581 54(36) 


24 


1000 X 2 


33 


0.57839(24) 


32 


700 X 2 


25 


0.576 72(20) 


48 


400 


6 


0.57491(33) 



Table 8.5: Value of the peak position for our different system sizes. The result for L = 48 is 
of comparable accuracy, despite being computed from only 6 tethered simulations around 
the saddle point. Recall that for L < 32 we can average over the positive and negative 
peaks, so the number of samples for these systems is effectively double the value shown in 
Table 10 
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Figure 8.20: Position of the peak against L from Table 8.5 with a fit to Eq. (8.471, giving 
/6/v = 0.0111(7). 
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The Edwards- Anderson spin glass 
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CHAPTER IX 



Spin glasses: an experimental challenge for 
theoretical physics 



During the 20th century, the experimental study of magnetic alloys gradually led to 
the identification of a class of materials that suffered a 'freezing transition' at some 
critical temperature Tc. In particular, if we denote by Sx the magnetic moment 
(spin) at site x in a lattic^ 

• Below Tc, each of the spins freezes in some orientation, {Sx)t 7^ 

• Though the freezing is collective, the spin orientations are random. In par- 
ticular, the average orientation of the spins is zero: jjJ2x{^x)t = 0- More 
generally, there is no long-range magnetic order of any kind 

^fc = ^Ee"''''(Sx>^ = 0, V/c. (9.1) 

X 

Notice that this expression includes both the ferromagnetic order parameter, 
k = 0, and the antiferromagnetic one, k = {n,n,n). 

• The dynamical evolution takes place at macroscopic times below T^. 

The position of the freezing temperature is typically signalled by the appearance 
of a cusp in the imaginary part x"{^) of the frequency-dependent magnetic sus- 
ceptibility, although it has been proved that this freezing corresponds to an actual 
phase transition (cf . Section 9.i.3[ >. 



These systems were soon seen to have many interesting physical properties, 
which in turn generated a great deal of theoretical and experimental interest. In 
the early 1970s, the concept of 'spin glass' was introduced, and applied to ran- 
dom, mixed-interacting magnetic systems that experience a random, yet coopera- 
tive, freezing of spins below some critical temperature. 



^In this section we consider the general case where the spins are three-dimensional vectors. 
^Here the (■ ■ ■)( denote an average over a very long measuring time. 
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The random and mixed nature of the magnetic interactions gives rise to frustra- 
tion (recall Figure 1.^) and this, in turn, generates a rugged free-energy landscape. 
Furthermore, the fact that spin glasses were particularly amenable to experimen- 
tal investigation and theoretical modelling (see below) quickly established them as 
quintessential examples of complex systems. 

In this chapter we shall begin by giving some brief notes on spin glasses from 
an experimental point of view in Section 9.1 This will also serve as an introduction 
to non-equilibrium physics and the concept of aging. We then discuss the generally 
accepted Edwards- Anderson model in Section 9.2 stating the main conclusions of 
the different (and incompatible) theoretical pictures that have been proposed for it. 
Finally, in Section 9.3 we motivate the numerical investigation of the problem that 
will be undertaken in Chapters 10 and 11 



9.1 

The experimental spin glass 

9.1.1 The canonical experimental spin glass: RKKY 

As we said above, in order to have a spin glass system we need at least disorder and 
a mixture of ferromagnetic and antiferromagnetic interactions between the spins. 
In principle, we would be looking for a system whose spin Hamiltonian consisted 
of exchange interactions: 

^xy ~ Jxy^x ' Sy, (9-^) 

where we need to have both positive and negative }xy- 

The classical solution to these requirements is the Ruderman-Kittel-Kasuya- 
Yosida (RKKY) interaction jRUD34[ [kAS36[ [y0537| . We consider a noble metal with 
magnetic impurities (for instance, CuMn, for copper with manganese impurities). 
Then, the magnetic moments scatter the conducting electrons, which gives rise to 
an indirect exchange interaction: 'Hx,x+r = }{^)Sx ■ Sx+r- For a large separation 
between impurities, we can write the coupling strength as 

(fcp is the Fermi momentum of the metal). Since the positions of the impurities 
or, in other words, the distances between the spins, are random, the oscillating 
nature of }{r) will generate precisely the random mixture of ferromagnetic and 
antiferromagnetic couplings that we need. 

9.1.2 Aging and other non-equilibrium phenomena 

The physical properties observed in spin-glass simulations are rich and surprising 
(see [,bin861[myd93| for reviews). The main feature is that, in general, the relaxation 
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T (K) 

Figure 9.1: Comparison of the zero-field-cooled (b and d curves) and field-cooled suscepti- 
bilities (fl and c) for two samples of CwMn with different concentration of impurities. Graph 
from [ iNAG73 t as quoted in IIbin86I . 



times below Tc are not only macroscopic, but large enough for the system to be 
out of equilibrium for the whole of the experiment. When the spin glass evolves at 
fixed Ti < Tc, some frozen or spin-glass phase slowly forms. In this situation, the 
system is said to age |jviN97|: the response to a field variation depends on the time 
spent at Ti. 

Let us see some examples of this. First we consider the experiment quoted in 
Figure |9.i[ In it, the authors measured the magnetisation M induced in a spin glass 
by a small applied field h and used it to define a dc susceptibility[3] ;\;dc = M/H. 
They then consider two experimental protocols. In the first one (field-cooling) they 
apply the field above Tc and cool the system down to Ti < Tc, without switching 
off the field. The resulting curve is reversible if the temperature variation is cycled 
and it has the peculiarity that the becomes constant below T^. In the second 
(zero-field cooling) the temperature is lowered to Ti < T^ without applying any 
field. Then, the field is activated and the sample is reheated across the critical 
temperature. Above Tc, Xdc^ coincides with the field-cooled version, but this is not 
the case in the frozen phase. There, the curve is no longer constant, but increasing 
with T. Furthermore, while the FC curve was reversible, this one is not and it 
also depends on the temperature-variation rate. We are seeing a system out of 
equilibrium. In fact, if, after zero-field-cooling the system, we were to switch on 
the field and let the system evolve at fixed temperature, the susceptibility would 
grow slowly, approaching the FC value but never reaching it, even for waiting times 
of several hours. 



3 Notice that this is different from the frequency-dependent, or ac, susceptibility we mentioned 
earlier. 
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Figure 9.2: Memory and rejuvenation effects in an experimental spin-glass (see text for 
explanation). Figure taken from [jon98| . 



A different, but related, example is the thermoremanent magnetisation. In this 
case, one cools the system in a field down to Ti < T^, then waits for a time iw, 
switches off the field and measures the decay of the remanent magnetisation when 
a time t has passed after switching off the field. It turns out that this magnetisation 
depends both on t and on the waiting time t^ that the system evolved in a field 
(the decay is slower for longer ty^). This phenomenon is called aging. For a rapid 
cooling, the remanent decays as a function of [(^ + ^w)/^w] jRODOSj , so is the 
only recognisable time scale. If ^ = 1 we say that we have full aging. 

Notice that the concept of aging also applies to the coarsening dynamics of fer- 
romagnetic systems, although there the process is much faster [sicoSj. However, let 
us stress that the differences between the dynamics of spin glasses and ferromag- 
nets are much more profound than a simple change of speed. As an example of 



this, let us consider the experiment in Figure 9.2 taken from ]jon98| 1. In this study, 
the authors took an experimental sample and measured the imaginary part x" of 
the ac susceptibility as the temperature was varied. The applied field had a low 
frequency co/2n = 0.04 Hz. In particular, they took the following three steps: 

1. Starting at some temperature above T^, they cool the system at a constant 
slow rate of 0.1 K/min and then reheat it at the same rate. This cooling 
rate (in terms of the logarithmic derivative of the temperature), is very small 
compared with the frequency, co, which is equivalent to considering a regime 
where t <^ ty^. The resulting x'^T') is seen to be essentially reversible and rep- 
resented by a thick black line. Notice the cusp at the transition temperature 
Tc ^ 15 K. 

2. Back at the maximum temperature, the system is cooled again at the same 
rate (curve with empty diamonds). Now, however, they stop the cooling at 
Ti < Tc and let the system age for a few hours. This produces a 'dip' in the 
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X curve. 

3. They resume the cooling and the susceptibility quickly returns to the refer- 
ence curve, as if it had not aged at all. This effect is often named rejuvenation. 



4. Now, the system is reheated at the same rate, without stopping at Ti (curve 
with black diamonds). Surprisingly, the aging dip in the susceptibility is re- 
produced. Therefore, the rejuvenation of the previous step was not complete: 
the system has retained some memory of its previous aging. 

No longer can we talk of a simple, temperature-independent spin-glass phase 
as in a ferromagnet. A picture of thermally activated barriers is also discarded, 
since the time spent at Ti does not contribute to aging at T2 < Ti. Instead, we 
have some sort of temperature chaos: the appropriate spin-glass phase at Ti looks 
completely random from the point of view of the system at T2. 

Still, even if the dynamical behaviour of spin glasses is very complex, there is 



some regularity. We shall see in Section 9.2.4 how it is possible to make experimen 



tally testable predictions for several non-equilibrium quantities. 



9.1.3 Spin glasses as models for general glassy behaviour 

Structural glasses are formed when the viscosity of a liquid grows so much that 
the latter loses its ability to flow. The system drops out of equilibrium and its 
microscopic structure seems frozen. However, there is no long-range order: a still 
photo of the molecular arrangement would show no great difference from that of a 
dense liquid. 

The above picture is, in qualitative terms, much the same as the one we painted 
at the beginning of this chapter, when introducing the spin-glass transition. This is 
so despite the fact that the physical causes for the two kinds of freezing processes 
are very different. 

This observation is, of course, the motivation for the name of 'spin glasses' but, 
more than that, it is also the reason for much of their physical relevance. Indeed, 
taking spin glasses as model systems for investigating general glassy behaviour has 
many advantages, both from an experimental and from a theoretical point of view 

• The nature of glass formation is still poorly understood, it has not yet been 
possible to relate it to a reproducible thermodynamical phase transition. On 
the other hand, the freezing of magnetic moments in spin glasses is known 
to be a phase transition, both in experiments jGUNgij l, where one observes a 
diverging non-linear susceptibility, and in theoretical models [falqq| IbalooI . 
Knowing that one is really below a proper critical temperature has many 
advantages from the point of view of analysis and interpretation of results. 

• In both classes of systems, the experimental determination of characteristic 
length scales is very difficult (the microscopic correlations are not accessible 
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and one has to use an indirect determination via some susceptibility). How- 
ever, spin-glass systems have two advantages: (i) The size ^ of the glassy 
domains is much larger in spin glasses, and therefore easier to access experi- 
mentally (compare, e.g., f JOH99] [bero4b| with jBER03] ). (11) In experiments on 
structural glasses, the measurements are specially delicate and taken with a 
variety of methods (in [bero^I they estimate the enthalpy fluctuations from 
measurements of the dielectric susceptibility). Spin-glass experiments, on the 
other hand, use SQUIDs, allowing for a far greater precision and convenience. 

The free-energy barriers in fragile glasses grow with a power law, while in 
spin glasses this growth is ^ log ^, as we shall see. 

Finally, spin glasses are easier to model theoretically (lattice models with sim- 
ple interactions), making their subsequent analytical and numerical investi- 
gation more straightforward (although by no means easy, as we shall see in 



Section 9.2). 



Still, even if we accept that spin glasses are more convenient model systems and 
that they have a qualitatively similar behaviour to structural glasses, we need some 
concrete, quantitative bridge between both systems, if only to see how far we can 
take the analogy. 

In this sense, the study of dynamical heterogeneities can provide a valuable 
common ground. This topic has received much experimental attention in the last 
two decades liEPiooi iweeooi ikegooi ivrooi ioukioi iBERiij] and, has, in fact, been iden- 
tified as the key to understanding the statistical mechanics of amorphous solids. 
Indeed, even though in structural glasses there is no long-range order, there are 
non-trivial spatiotemporal fluctuations, caused by the wide distribution of relax- 
ation rates across the system. A thorough understanding of these dynamical het- 
erogeneities can therefore give many clues on the nature of the glassy phase. 

This heterogenous dynamic behaviour has recently been studied numerically 
in spin-glass systems [|CAS02i |caso3[ |jauo7[ Iaroo8| . In this context, the dynamical 
heterogeneities can be characterised by a two-time correlation length ^(i, iw), al- 
though the time scales that have been traditionally reachable in simulations have 
not permitted the authors to see correlation lengths greater than a couple of lattice 
spactngs. At the same time, some recent experimental works suggest liouKioi that 
^ will soon be accessible experimentally. Therefore, there is every indication that 
work on dynamical heterogeneities in structural and spin glass systems is soon go- 
ing to converge in a unified approach. In the following chapters we shall dedicate 
much attention to ^, which will be the basis of a finite-time scaling paradigm for 
the study of non-equilibrium dynamics. 
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9.2 

The theoretical spin glass: the Edwards- Anderson model 

The complexity of the experimental spin-glass phase is both an incentive and a 
formidable challenge for theoretical physics. We need a system simple enough to 
allow for, at least, some level of analytical treatment yet complex enough to explain 
the rich physics observed in experiments. 

The first obstacle is the very definition of an order parameter that could signal 



the onset of the spin-glass transition. By definition of spin glass — Eq. 9.1 — we 
cannot use anything based on long-range order. Moreover, the temperature chaos 
property implies that the frozen state of the magnetic moments is going to depend 
on temperature. Under these conditions, Edwards and Anderson ||EpW75| proposed 
replacing the spatial correlations of spins at different sites with a time correlation 
at the same site. Considering a very large sample with N nodes, we hav^ 

I 

^ = }^]:^Il(^^i^)^^i^))f (94) 

In other words, they considered the overlap between the configuration at two dif- 
ferent times (in equilibrium). In the spin-glass phase, where the spins are frozen, 
this is non-zero. In particular, we would expect q = 1 at T = and q ^ when 
T — > Tc (as in a second-order transition). 

Now we can take the usual step of replacing the time average limf^.oo (• • with 
an ensemble average and write 

^=^E(s^)^- (9-5) 

Now that we know how to define a workable order parameter, we need a more 
tractable spin interaction than that defined by the RKKY theory. The Edwards- 
Anderson model is then defined in the simplest possible way: as a random mixture 
of positive and negative nearest-neighbour couplings, 

H = - ^ JxySxSy. (9-6) 

The quenched }xy are extracted from some probability distribution such that Jxy = 
0. The actual shape of the coupling distribution is not very important (universality), 
the most popular examples are Gaussian and bimodal (±/) couplings. 

9.2.1 The mean-field spin glass and the RSB picture 

The Edwards- Anderson (EA) model is simple enough to formulate, but its solution 
is quite a different matter. In the DATE, one considers the mean-field approxima- 



4 We go back to considering Ising spins, Sx = ±1. This does not constitute an unphysical simpli- 
fication (consider, for instance, an orthorhombic lattice where one direction is favoured). 
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tion, which turns out to be trivial, and attempts to extend the solution to finite 
dimensions using the perturbative renormalisation group (a difficult task, to be 
sure). In the case of the EA spin-glass, as we shall see, not even the mean-field case 
is easy. In the following we give an outline of the mean-field solution for the EA 
spin glass. Our aim is not to present the full derivation, which is well explained in 
several places (see, e.g., liDOTOii lDOMo61). Rather, we shall simply give the essential 
information to understand its testable (and exotic) predictions for the nature of the 
spin-glass phase. 

The mean-field version of the EA model is the Sherrington-Kirkpatrick |j5HE75| 
model 

H = -YjhjSiSj. (9.7) 

For the sake of simplicity, we have numbered the lattice sites, rather than denoting 
them by their position vector, as we have usually done. The interactions are now 
not only restricted to nearest-neighbours, but exist between every pair of sites (this 
can be seen as an infinite-dimensional version of the EA model). 

The couplings are extracted from the following conveniently normalised Gaus- 
sian distribution 

= n\/I-p(-y/?)' (9-8) 



'<7 



so we have 



k = 0' /? = ^- (9-9) 

The factor 1/N has been chosen so that the total energy at fixed fi is proportional 
to N, to make the energy density independent of N. 

Now, let us compute the Edwards- Anderson order parameter with the replica 



trick, which we introduced in Section 2.2 First, let us write this quantity in replica 



notation. We start with the basic definition (9.5 1, averaged over disorder 



AT ' 

t ^} 

rn-2 



Now we multiply numerator and denominator by Z" and follow the same steps 



as in Section 2.2 The numerator again depends on n replicas, two of which, {}i and 

V, 



, say) are for the two spins that appear explicitly in ( 9.io[ >. Therefore, 



After performing the average over the /, we have, in the notation of Section 2, 

= — y lim (sf sf) , (9.12) 
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Our main goal, then, is to compute the effective Hamiltonian that defines the (•)„. 
The replica partition function is (omitting irrelevant pre-exponential factors) 

Z„ = Zf = E / exp(^ £ Y.hjS^s'; - ^NE/?) (9-13) 

{s"} ^ a=\i<i i<i / 



The integration over the can be performed explicitly with (9.8 1 and gives 



a<b ^ i ' 



(9-14) 



Finally, one linearises the sum over the sites by introducing the so-called replica 
matrix Q^^, 



\a<h-' J {s«} L a<b a<b i 



By definition. 



Qah Qhai 



Qa 



0. 



After some algebra, one can represent (9.15 1 as 

Zn = j DQ,foe^«^Q«">, 
where the effective Lagrangian is 



a<b 



. s« V a<b J 



(9-15) 



(9.16) 



(9-17) 



(9.18) 



Notice that we have eliminated the summation over i and constructed a simpler 
single-site Lagrangian. Finally, the equilibrium values of Q^b are defined by the 
equation SCn/^Qab = and turn out to be 



Qal 



1 



E(s^ 



I In' 



a ^b, 



(9-19) 



where the average (•)„ is defined by the partition function in (9. 17 1. 

It is here that the computation must stop, unless we introduce some ansatz for 
the replica matrix Q^i,. Since the replicas are all equivalent, one could be tempted 
to use a symmetric form 

Qab = (1 - ^ab)(]- (9-20) 

Using this hypothesis, the computation can be taken to the end. In particular, after 
performing the n — ^ limit, one finds that the value of q is given by the saddle- 
point equation 

-■^ dz 



exp(-z^/2) tanh^(|6z^). 



(9.21) 
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(9-23) 
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(9-24) 



Figure 9.3: Sketch of the first two steps of replica symmetry breaking. 



This equation has no non-trivial solution above T = Tc = 1, while for T < Tc it 
has a single solution q^AiT), such that limi^.^o ^?EA(7^) = 1- Even more, according 
to ( |9.i9| >, ( |9.2o[ ) and ( |9.i2[ > we have 



qEA{T) 



1 

N 



riim 



N 



(9.22) 



which is just the Edwards- Anderson order parameter that we introduced earlier. 
Therefore, it would seem that the replica-symmetric solution reproduces just the 
behaviour that we want. 

There are severe problems, though. The most serious being that the entropy 
becomes negative at low temperatures and that the spin-glass phase becomes un- 
stable in the entire T < T^. region. 

Clearly, the replica symmetry must be broken somehow. The way to do this 
was explained by Parisi jPAR79Af IparSoI . We consider the n x n replica matrix 
Qab- Before breaking the replica symmetry, Q^^i, = {1 — Sa},)qo- We now divide 
the matrix into constant blocks of size mi x mi and set each diagonal block as 
a submatrix whose off-diagonal elements are all qi. All the elements in the off- 
diagonal blocks remain in their original value qo. We then take a second replica 
symmetry breaking (RSB) step and subdivide the diagonal blocks, introducing a 



new overlap q2- See Figure 9.3 for a sketch of these two first steps. This process is 
continued indefinitely. 

Notice that the replica equivalence property is conserved, because all rows have 
the same elements (in different order). Now the values of the matrix elements can 
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be described by the pdf 



P(^) 



1 



Yj^{Qab-q) 



(9-25) 



n{n — 1) 



[{n - mi)3{q - qo) + (mi - m2)5(q - qi) 
+ {m2-ni3)3{q - q2) + ...] 



n 



(9.26) 



n{n — 1) 



(9-27) 



Finally, we have to take the limit n — > 0, 

p{q) = miS(q - qo) + (m2 - mi)3(q - qi) + {m^ - m2)S{q - q2) + ■ ■ ■ (9-28) 

This limit has inverted the order of the m/, so now we have < nti < m2 < . . . < 1. 
In the limit of infinite RSB steps we obtain a continuous variation, so qj^ — ^ q{x), 
with X G [0,1]. 

In other words, the spin-glass order parameter is no longer a number, but a 
function. In particular. 



There are infinitely many states, whose overlap q' is < q' < q^A = ^(1)- It can be 
seen that the RSB method cures the problems of the symmetric ansatz and produces 
a stable spin-glass phase. In fact, it has been proved rigorously that the RSB scheme 
produces the correct free energy for the Sherrtngton-Kirkpatrick model |talo6 | | . 

Some of the main physical predictions of this mean-field solution are (we shall 
see more in the following sections) 

• Below the critical temperature, there are infinitely many states. This is rep- 
resented by a non-trivial probability distribution of the spin-glass order pa- 
rameter q, which can take any value in [—qEA/^jEA] with non-zero probability 
density. 

• The states are organised in a hierarchical structure, giving rise to an ultra- 
metric overlap space. This is best seen by representing the RSB process as a 
branching tree. At lowest order, qo, all the elements are equal and contained 
in one branch. As we break up the ensemble of states into more and more 
clusters we keep increasing the value of q. Then, to find the overlap qab of 
two replicas belonging to different branches we simply have to follow up the 
tree up to the encounter point. Clearly enough, we have an ultrametric space 
where qac ^ T^^T'^iqab/'^bc}- This hierarchical organisation might explain the 
temperature chaos effect observed in the experiments. 



dx 



(9-29) 



dq 



In the presence of an external magnetic field, the spin-glass phase is not de- 
stroyed. 
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Such is the mean-field solution to the EA model. In principle, one could try to 
extend this description to finite dimensions by means of a perturbative renormali- 
sation group. One would expect the D = 3 spin-glass phase to share the essential 
properties of the mean-field solution (infinity of states, etc.) but differ in some 
details. In this way, one obtains the so-called RSB picture of the spin-glass phase 
(see | |dom98[ Idomo61 |mez87[ imaroobI for some analytical results and more detailed 
physical predictions of this theory). 



9.2.2 The droplet picture 

The above described RSB picture is not generally accepted as a faithful description 
of the spin-glass phase in D = 3. The most popular alternative is the droplet 
theory ]BRA84t |BRA87t [mcm84[ Ifis86bI lFis88"Aj l . which stems from a Migdal-Kadanoff 



approximation to the EA model. 

The droplet approach paints a completely different picture of the spin-glass 
phase. Instead of having a non-trivial order parameter distribution, it considers 
only two equilibrium states, with q = it^EA- 

From a phenomenological point of view, the model assumes that the lowest- 
energy excitations of the system are compact domains, or droplets, of coherently 
flipped spins. In particular, the typical free energy of an excitation of size L scales 
with Ly , where y is the so-called stiffness exponent. Alternatively, zero-energy 
droplets of size L occur with probability L~y . It can be proved that 

y<(D-l)/2. (9.30) 

Notice that the stiffness exponent must be positive in order for there to be a sta- 
ble spin-glass phase. The issue has been examined numerically in D = 3, resulting 
in values of y ^ 0.27 licAR02i or y ^ 0.24 jBOE04}[BOE03| . In D = 2 the exponent is 
negative and the spin-glass phase disappears. 

One of the main consequences of this picture is that the spatial correlations 
decay with y, 

C{r^) = {srSjf - {sif{sjf - -J . (9.31) 

For this reason, q'^ — q^ — ^ 0, that is, the p{q) that we introduced in the previous 
section has zero variance. This is translated into the previously mentioned property 
of a trivial overlap, where p{q) = 5{q^ — q\p^- 

Another point of departure of the droplet and RSB pictures is the behaviour 
in an external magnetic field. The RSB predicts that the spin-glass phase can sur- 
vive a small magnetic field. However, in the droplet picture we can generalise the 
Imry-Ma argument of Chapter j/j The energy barrier for flipping the droplet is 
Ly — hL^^^. Since y < (D — l)/2, this makes the spin-glass phase unstable in the 
presence of even an infinitesimal h. 

Finally, let us mention that even if the droplet picture is simple compared to the 
RSB one, it still can explain the complex physics of the experimental spin glass. In 
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particular, the temperature chaos property is expected. Indeed, the droplet bound- 
ary scales as with D — 1 < Dg < D. Therefore in order to produce the U scale 
dependence, with y < Dg, different parts of the droplet's boundary must give al- 
ternating contributions. This gives rise to a delicate balance of energy and entropy 
that is easily upset with a small change in temperature. 

9.2.3 The geometry of the excitations and the TNT picture 

A final interesting issue concerns the geometry of the activated domains. In the 
droplet picture we have assumed these are compact, so their surface scales as 
where D — 1 < Dg < D, and their surface-to-volume ratio goes to zero as L ^ 00. 
Furthermore, there are no zero-energy excitations. 

In the RSB picture, however, we do have to consider excitations with zero energy 

cost. Let us denote by {s^^} the original spin configuration and by {s^x ^} the 
excited one. Then, we consider the link overlap (as opposed to the spin overlap q) 

Qunk = ^ E 4°'4'"4''4'^ (9.32) 

^ {x,y) 

where N\ is the number of links. This may seem a pointless definition from the 
mean-field point of view, since in the Sherrington-Kirkpatrick model Qiink = 
trivially, but we shall see how we can get valuable information from this observable. 
We note that in D = 3 we no longer have this simple relation, but the RSB picture 
still expects Qunk to have a non-trivial distribution (i.e., be some function of q'^). 

Let us now consider an excitation where we pass from a ground state to another 
by flipping 0{N) spins. The link overlap, unlike q, only changes for those links 
across the surface of the flipped domain. Its relative change is, therefore, of order 
2^Ds-D order to make this compatible with a non-trivial distribution for Q^nk 
it follows that for RSB systems Dg = D. In other words, the excitations are space- 
filling. 

The above considerations led some authors to consider the issue of zero-temperature 
excitations and their geometry, as a means of deciding between the droplet and 
RSB pictures. In the year 2000, Palassini and Young [ipaloo] and Krzakala and Mar- 
tin [iKRZOOi reached the following conclusions from their numerical simulations 

• There exist excitations involving 0{N) spins but having finite energy. That 
is, the spin overlap distribution is non-trivial (as in RSB). 

• The surface of these excitations is Dg < D, so the link overlap distribution is 
trivial (as in the droplet picture). In particular [jPALOoi IPAL03] , D — Dg ~ 0.44 
in D = 3. 

Therefore, this trivial-non-trivial (TNT) picture emerges as an intermediate alter- 
native to droplet and RSB. 
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9.2.4 Coarsening vs. non-coarsening dynamics 

Throughout this chapter, there has been a disconnection between experiments and 
theory in that the former were considered in a non-equilibrium regime and the 
latter was explained in terms of the equilibrium spin-glass phase. The usual justi- 
fication for this approach is that the equilibrium structures, though unreachable in 
finite time, do condition the dynamical evolution of the system. 

Still, the explicit theoretical study of the non-equilibrium evolution can be very 
rewarding. To this end, the first (and most important) step in the understanding 



of complex phenomena such as those of Figure 9.2 is surely understanding what 
happens during the isothermal aging. 

In this sense, one considers the direct quench experimental protocol liBER02t 
IJIM03} IMAROOAI |kis96[ |RiE93f |jiM03[ , |PGo6l ICAS021 |CAS03} iJAuo/] . We start by consid- 



ering a starting configuration at a very high temperature (i.e., completely random 
orientation of the spins) and instantaneously cool it down to the working temper- 
ature T < Tc- We then let the system age for some waiting time iw and probe its 
properties at a later measuring time t + t^. In these conditions, the main feature of 
the dynamics is the formation and slow growth of coherent domains, whose size 
is characterised by a coherence length ^(iw)- We remark that this coherence length 
is accessible to experiments, through estimates of Zeeman energies f JOH99], where 
its extremely slow grow results in ^ ^ 100 lattice spacings, even close to Tc and 
for several hours of evolution (the evolution grows ever slower as we decrease the 
temperature). 

This much is agreed by all, but the controversy soon begins when we consider 
the nature of these coherent domains. On the one hand, in the droplet picture the 
dynamics consists in the growth, or coarsening, of compact domains where the spin 
overlap takes one of its two possible equilibrium values, q = it^EA- In this sense, 
the droplet spin glass behaves as a disguised ferromagnet. The aging dynamics of 
all coarsening systems is qualitatively the same. 

In particular, if one measures times and lengths in units of ^(^w)/ no non- 
equilibrium observable should depend on the quenched randomness and the re- 
sulting scaling functions should be identical for all coarsening systems [fis88a[ . 
This so-called superuniversality property then treats in exactly the same way sys- 
tems as diverse as the ferromagnetic or site-diluted Ising model, or even the DAFF 
that we studied in Part |III] I arooSII . According to the droplet picture then, the 
isothermal aging of the spin glass would be no different]^ 

We are not aware of any investigation of the dynamical consequences of the 
TNT picture, but an antiferromagnetic analogy suggests that TNT systems will 
also show coarsening behaviour. 

The study of the dynamics in the mean-field limit was thoroughly undertaken 
in the 1990s jcuG93[ [cug94b] . The emerging picture for an RSB spin glass resulted 



5 We stress that we are talking of isothermal aging. When the temperature is varied, the behaviour 
of a droplet spin glass is much more complicated than that of a ferromagnet, due to temperature 
chaos. 
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Figure 9.4: Violation of the fluctuation-dissipation theorem (dotted-dashed line) in an ex- 
perimental spin glass (from liHER02L see text for discussion). In the figure, t' corresponds 
to our tw and t corresponds to our t + tw 



much more complex than that of coarsening systems. The main difference is that 
in RSB systems there exist equilibrium states with q = 0, so the dynamics remains 
forever in this sector. Furthermore, the replicon, a Goldstone model analogous to 
magnons in Heisenberg ferromagnets, is present for all T < | |dom98[ |domo6I| . 
As a consequence, the spin overlap is expected to vanish even inside each domain, 
in the limit of large ^(iw)- Finally, the spin overlap q is not a privileged observable: 
the link overlap exhibits the same aging behaviour (overlap equivalence). 

A good example of the relevance of these predictions for experimental work is 
the case of the mild violations of the classic fluctuation-dissipation theorem (FDT). 
This result, proven in jcAL3i| , |kub37| , makes it possible to predict the system's 
response to an external field from its equilibrium dynamics, but only if the sys- 
tem was already in equilibrium when the field was applied. Still, the pattern 
of violations of the FDT for spin glasses admits a very simple parameterisation. 
Cugliandolo and Kurchan jcuG94A} |cuG97i l proposed characterising the deviations 
from the FDT behaviour by studying a fluctuation-dissipation ratio, representing 
the response as a function of the system correlations. 

This procedure was followed experimentally in [|HER02il and the result is repre- 



sented in Figure 9.4 In it, the authors consider the response function at time t + t^ 



of a field that was switched off at time t^, denoted by ^(i, iw)- This is represented 
against the temporal autocorrelation of the system C(f, fw)j^ Should the systems 
obey FDT, the resulting plot x{C) should fall on a straight line (dotted-dashed line 
in the figure). In the spin glass, the linear behaviour is indeed reproduced for 
short measuring times t, but the curves eventually deviate from this prediction at 
a fw-dependent branching point. 

In the ty^ ^ 00 limit, the RSB and droplet pictures expect a different pattern 



^The experimental definition and measurement of these quantities is somewhat involved, 
see rfHER02j for the details. 
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of violations. In both theories, the measured curved should branch off from the 
FDT prediction at C = q^p^. However, in the RSB framework one expects that 
limf^^>ood;^/dC would correspond to the Parisi function x{q) that we introduced 
in Section 9.2.1 [fr a98[ [FRAggj . In the droplet picture, on the other hand, the curve 
of violations should be horizontal. In Figure [94 the authors have attempted this 
extrapolation (dashed line). 



9-3 

The Edwards-Anderson spin-glass in numerical simula- 
tions 

In order to be made quantitative, the different theories for the spin-glass phase need 
input from numerical simulations. In this sense one can take two complementary 
approaches 

• Simulate a very large system with a Monte Carlo update that reproduces the 
physical evolution (such as heat bath or Metropolis dynamics). As long as 
^(iw) ^ ^, one can consider to be observing the same thermodynamic al-limit 
behaviour as in a physical sample. Then, take a long-fw limit to predict the 
physical evolution. 

• Simulate smaller lattices with some kind of optimised dynamics (typically 
parallel tempering) to equilibrate the systems. Then perform a finite-size 
scaling study to perform a large-L extrapolation. Use the information thus 
obtained on the equilibrium spin-glass phase to predict the experimental non- 
equilibrium behaviour. 

We stress that the goal of numerical work on the Edwards-Anderson model 
is ultimately to understand the nature of the spin-glass phase. In this sense, the 
situation is complementary to that of the DAFR In the latter, the low-temperature 
phase is trivial (antiferromagnetic order), but the phase transition itself is poorly 
understood. In the spin glass, the phase transition has been thoroughly examined 
numerically |PAL99f ibalooi IhasoSaI lHASo 8Blj7| and it is the low-temperature phase, 
as far from the critical point as possible, that has the most interest. 

Of course, the extreme sluggishness of the dynamics poses a problem for nu- 
merical investigations. For instance, the typical time scales of experiments on spin 
glasses range from a few seconds to a few hours, but each MCS is equivalent to 
only 10~^^ s |jMYD93|. Therefore, one has to perform an enormous number of lat- 
tice updates even to begin to approach the physically relevant regime. In this sense, 
non-equilibrium simulations have reached only about 10^* s. In the equilibrium 

^Actually, this is true for the Ising spin glass that we consider here. For the version with vector 
spins (Heisenberg spin glass), the nature of the phase transition is still controversial (see, e.g., 

[ jFER09B}IVIE10] ). 
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case, the lattice sizes that one can thermalise are very small, so the data are always 
suspect of being dominated by finite-size effects and not considered representative 
of the real physics. 

On the other hand, the understanding of the spin-glass phase is not yet ripe 
for a tethered study, because the appropriate reaction coordinate (which must be 
sample-dependent) is not known. Still, some of the techniques developed in the 
previous chapters will prove to be useful. 



9.3.1 Introduction to our work with Janus 

In the following chapters we shall take a brute-force approach to the numerical 
study of the spin-glass phase. In particular, we have carried out our simulations 
using Janus, a special-purpose machine optimised for Monte Carlo simulations of 
spin systems. For this restricted class of problems, Janus can outperform conven- 
tional computing architectures by several orders of magnitude, thus providing a 
qualitative jump in the time and length scales that can be accessed. In particular, 
we have been able to follow the non-equilibrium dynamics up to times of 0.1 s, 
at the threshold of the experimental scale, and we have thermalised lattices up to 
L = 32 down to T = 0.64rc. 

The characteristics of Janus are very briefly explained in Appendix D The Ising 
spin-glass simulations carried out with Janus, whose parameters we report in Ap- 
pendix |E| have produced four physics publications so far: [' jANoSct | JAN09 a[ [JAn lOAt 
|janiob| . This work is the fruit of the collaboration of a large number of people, in- 
cluding physicists and engineers. To reflect this, the following chapters include 
only those physical studies where the author carried out a major fraction of the 
work. In particular, we leave out the results corresponding to Sections 5.2 and 6 
in jjANogA] , as well as those corresponding to Sections 7.2 and 7.3 in Hjanioa] . For- 
tunately, the quoted sections are relatively independent from the rest of the works, 
so our presentation will be self-contained. 

We note that the order of presentation of our results in this and the next chapter 
is markedly different from their original development. In our original work we 
were of course constrained by the running of the simulations (we carried out the 
non-equilibrium ones first, while our equilibrium simulation campaign took over 
a year). In this dissertation, we shall try to present the results in a logical fashion, 
rather than in a chronological one. 



CHAPTER X 



The statics-dynamics correspondence and the 
finite-time scaling paradigm 



In this chapter we tackle the second of our main topics: the relation between the 
(experimentally unreachable) equilibrium phase and the off-equilibrium dynamics 
in the context of spin glasses. 

After introducing the relevant physical observables and correlation functions, 
we try to characterise the isothermal aging dynamics of the D = 3 Edwards- 
Anderson spin-glass. We begin by using only non-equilibrium simulations and 
study issues like the growing characteristic lengths or the thermoremanent mag- 
netisations. 

Eventually, however, the non-equilibrium study faces an imposing stumbling 
block: in order fully to understand the reasons for the dynamical behaviour at finite 
time one must consider extrapolations to infinite time. In particular, we observe 
a very interesting crossover behaviour in the so-called dynamical heterogeneities 
(Section io.^\ that we will be unable to quantify. 

It is then that we introduce a complementary approach: that of finite-size equi- 
librium simulations. First, we promote the relation between the equilibrium phase 
and the dynamics to a quantitative level, through a time-length dictionary. This 
will allow us to introduce one of the most important concepts (and results) of this 
study: the notion that experimental non-equilibrium physics is equivalent to the 
equilibrium behaviour, not of the thermodynamical limit, but of a more modest 
lattice of L - 100. 

Finally, we resume the analysis of dynamical heterogeneities from an equilib- 
rium point of view. This allows us to perform a fully quantitative study and, 
through it, to take the statics-dynamics one step further: the finite-time scaling 
paradigm. 

Throughout this chapter, we shall focus only on those aspects of the spin-glass 
dynamics and equilibrium about which the different theoretical pictures are in 
full agreement. For instance, we do not try to decide whether the dynamics is 
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coarsening-like or not. We shall see that this neutral approach can actually take us 
very far. 

In Chapter ii we shall, on the other hand, address the RSB-droplet-TNT con- 
troversy directly. We shall then try to decide which of these is a better description 
of the spin-glass phase. 



10.1 

Physical observables for spin-glass dynamics 

We work on the Edwards- Anderson model introduced in the previous chapter. 



■H = - J] JxySxSy. §^ 

We use bimodal couplings, }xy = ±1 with 50% probability. For this Hamiltonian, 
the phase transition occurs at Tc = 1.109(10) I|haso8a[ |haso8b|| . 

We study the dynamics in the direct quench protocol explained in the previous 
chapter. We use heat-bath dynamics, which is in the universality class of physical 
evolution. We take 1 MCS to correspond to 10~^^ seconds | jMYD93] . When studying 
the non-equilibrium dynamics we shall consider that we are in the thermodynam- 
ical limit, even if we simulate in a finite lattice (L = 80 in our case). In practice, 
this approximation is good as long as the length scale of the coherent domains that 
grow during the evolution remains much smaller than the system size. We shall 



check this issue in Section [10.3.2 



We represent by 0(iw) the time evolution of some observable O for a single sam- 
ple, and by 0(fw) its disorder average. In principle one would have to average over 
many thermal histories. However, the main source of error is sample-to-sample 
fluctuation, so it is better to run more samples than more histories for each one. 
Still, we run two independent thermal histories for each sample (two real repli- 
cas), for reasons that will become apparent below. Whenever one observable can 
be averaged over the two replicas, we shall do so, even if we do not indicate it 
explicitly. 

The first issue we have to consider when defining physical observables is the 
gauge symmetry induced by the quenched average jTOU77] . Indeed, the Hamilto- 
nian is unchanged under the transformation 

Sx > OixSx, (10.1) 



Jxy > OixJxyl^y, (1O.2) 

where ctx = ±1. In principle, this would mean that one should have to consider for 
each observable its gauge average 2~^° 0({(XxSx})- Notice, however, that the 
gauge-transformed couplings are as probable as the original ones, so the disorder 
average of O is the same as its average over the gauge orbit. 
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We need, therefore, gauge invariant observables. The usual way out of this 
problem is introducing real replicas. These are defined as statistically independent 

systems {s^''*} that evolve with the same set of fxy Their overlap field is 



(10.3) 



obviously gauge-invariant. 

The averaged density of the overlap field defines the spin-glass order parameter, 
the spin overlap q 

1 



N 



(10.4) 



We can also consider the spin-glass susceptibility 



;^SG(iw) = Nq^{ty,), 



(10.5) 



which, as can be seen using fluctuation-dissipation relations, is essentially the same 
as the non-linear susceptibility — see (54]) and (5.23 1. In fact (in equilibrium) 



6X4 = ;tsG -2/3. 



(10.6) 



lo.i.i Temporal correlations 

Since we are working on an aging system, we need to consider two time scales, as 
we commented in Chapter [gj We are particularly interested in the overlap between 
the configurations at the waiting time and at the measuring time t + t^, 

Cxit,tw) = S^x\tw)s''x\t + tw). (10.7) 

Notice that this quantity needs only one replica. 

We can use the two-time overlap to define the temporal correlation function 



C(i,iw) = ^E^^(^'^w). (10.8) 



For fixed iw, C{t, t^) is strictly decreasing in t, indicating the gradual but very slow 
decorrelation in the system. In fact, for fixed iw, the C(i, iw) can be interpreted as 



the thermoremanent magnetisation (Section 10.4 1. This is because we can make 
a gauge transformation from the uniform configuration resulting from a strong 
magnetic field to the random initial configuration in our simulations. Therefore, 
the overlap with this random configuration plays the part of a magnetisation. 

In the pseudoequilibrium regime, t <^ ty^, the (real part of the) magnetic sus- 
ceptibility at frequency co = In I T is given by the fluctuation-dissipation formula 

(i-c(f,fw))/r. 
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For any finite t^, limt^oo C{t,tw) = 0. However, in equilibrium t^ — > oo, the 
overlap has a finite value below T^. In particular, if we define the stationary, or 
translationally invariant, part of the correlation, 

Coo(i) = lim C{t,tw), (10.9) 

we get 

flEA = lim Coo(0- (10.10) 
This equation defines the value of the Edwards- Anderson order paramete r]^ The 



dynamical computation of q-^A from this limit is notoriously difficult l|PGo6l [1N197I, 



In this 



10 



1. 



even with the extremely long times simulated on Janus (Section 10.6 ( 
chapter, we shall consider an alternative, equilibrium, approach (Section 

The introduction of the stationary part of C{t,tw) can be generalised to a divi- 
sion of this correlation function in several time sectors. We can define the }i sector 
by 

C^'Vs) = lim (sif ,tw). (10.11) 

The translationally invariant sector would thus correspond to ^ = 0. Notice that 

q-EA < C^\s) < 1. In order to cover the whole range of values C G [0,1], at least 
another sector must be considered. The simplest (and experimentally supported) 
one is the full-aging regime |jEDDD5|, obtained by setting ^ = 1. If full-aging holds, 

one would expect Coo (s) to cover the whole range from q^j^ to 0. We shall examine 
these claims in Section [10.2 



We finally note that some authors licAS02i ICAS03] use different definitions for 
these functions, so care must be exercised when consulting the literature. In par- 
ticular, often t is taken to be our t + t^ and the Coo{t) is defined so that it tends to 
zero (i.e., subtracting q^A from it). 

10.1.2 Spatial correlations 

The spin-glass dynamics is characterised by the growth of coherent domains both 
in the coarsening and the non-coarsening pictures (even if their shape and nature 
differ). These can be studied through the spatial autocorrelation of the overlap field 



I 

C4(r,iw) = ^Yjqx{tw)qx+r{tw) ■ (10.12) 



X 



In particular, the long-distance decay of C^ir, iw) defines the coherence length ^(i^ 



C4(r,^w) ^f{r/at^))- (10.13) 



^Recall that in a droplet setting this is the only equilibrium value of q, while in an RSB setting it 
is the maximum of the Parisi fimction 
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At the critical point, a is non-zero and related to the anomalous dimension, through 
fl(Tc) = D - 2 + 7/. In D = 3, a{Tc) = 0.625(10) IIhasoSaIIhasoSbH . 

Below Tc, the value of a characterises the dynamics. For fixed r/^(iw), the 
correlation function of a coarsening system does not tend to zero when r — > 00, 
since the domains are compact. In a system described by the RSB picture, how- 
ever, there exist equilibrium states with q = 0, so the dynamics remains forever 
with a vanishing order parameter. This is related to the existence of the replicon 
mode, a Goldstone boson (like magnons in Heisenberg ferromagnets or pions in 
high-energy physics). Therefore, a ^ for RSB systems. It has been conjectured 
that a{T < Tc) = fl(Tc)/2 |dom98[ . The exponent a is probably discontinuous at 



Tc | |rAR97] . 

In short, the value of a below the critical temperature is a good marker to dis- 
tinguish coarsening dynamics (droplet or TNT) from the RSB predictions. We shall 
consider this issue in Chapter 



11 



Notice that the presence of the damping function is required due to causality 
considerations. In the case of C4^{r,tw), the damping function / seems to decay a 
little faster than an exponential | jjiM03} iMAROoAi , 

/(x) - |6 ^ 1.5. (10.14) 



Notice that the spin-glass susceptibility ( 10.5 1 can be defined as 

Xscitw) = Nf(t^ = f d^r Ci(r, t^). (10.15) 



In order to characterise the aging dynamics, one must take dynamical hetero- 
geneities into account [iCAS02i[rAS03}|jOT07| . This is best accomplished through the 
two-time spatial correlation 



Cl+lir, t, tw) = [^^(^' iw)Cx+r{t, tw) - C2(i, t^)] (10.16) 

^ X 

Notice that this is a connected correlation function. In principle, it may seem more 

2 

natural to subtract C{t,tw) , but due to the self-averaging nature of C (see below), 
both definitions are equivalent in the thermodynamical limit. We note that this 
correlation is natural for the structural glasses problem, where the order parameter 
is not known (see, e.g., jBER05| and references therein). 

C2+2 seems complicated, but it actually has a simple probabilistic interpretation. 
Let us call a defect a site where Cx{t,tw) = —1 and let n{t,tw) be the density of 
such defects. Trivially, C{t, fw) = 1 — 2n{t, t^)- Then, the conditional probability of 
finding a defect at site x + r, knowing that there is one at x, is n{t, tw)g{r), where 
g{r) represents the pair-correlation function of the defects. Hence, C2+2(j'/^^w) is 
just 4n2(f,f^)(l-g(r)). 

The long-distance decay of €2+2(1', t,tw) defines the correlation length ^{t,tw), 
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the characteristic length scale for dynamic heterogeneitiesj^ 

C2+2{r,t,t^) > -^g{r{^{t,t^)) . (10.17) 

Little is known in the literature about either h of g, since, before Janus, the time 
scales that could be explored reached only very small values of ^{t, t^). 

Notice that we have called ^ a correlation length, while we said that ^ was a 
coherence length. The distinction stems from the fact that the former is computed 
from a connected correlation function and the latter from a non-connected one. In 
particular, ^(iw) diverges in the large-^w limit, while this may or may not be the 
case for ^. We must point out, though, that this nomenclature is not universal. 

In the RSB framework, we can distinguish two sectors in the dynamics: (i) 
relaxation within a single state, where q-^A < C < 1, and (11) exploration of new 
states, where C < r^EA- These can be identified by the relation of ^ and ^. In 
particular, for sector (i) we expect ^{t,tw) <^ ^{tw)- We shall find ^(f,fw)/^(^w) a 
valuable dynamical variable (Section lo.g^ . 



Throughout this chapter, and the next, we shall need to consider the evolu- 
tion of the physical observables along many orders of magnitude in time, which 
complicates their interpretation and analysis. Yet, recall that, for fixed t^, C(i, iw) 
defines a one-to-one relation between C and f . Hence, we shall often eliminate t as 
an independent variable in favour of C. 

A final note concerns the issue of self-averaging. Notwithstanding the discus- 



sion in Section 2.2.2 most of the observables considered here are actually self- 
averaging. This is because, in the non-equilibrium regime, the coherence length 
is much smaller than L, so we essentially have about iP liP independent blocks. 
The exception is X'X.i which is non-local. In fact, the central limit theorem suggests 
that the probability distribution of (?(fw) should tend to a Gaussian when L — 00. 
Therefore, the variance of a;sg(^w) is ^ 2;\;gQ(fw) in the limit of large systems. 



10.1.3 Our non-equilibrium simulations 

We comment the parameters of our non-equilibrium simulations, as well as some 
technical details, in Appendix |E} Let us summarise by saying that we have run 
simulations for three subcritical temperatures T = 0.6,0.7,0.8 as well as for T = 
1.1 ~ Tc. For the first three temperatures, we reached t^ = 10^^, simulating 96 
samples for T = 0.6,0.8 and 63 samples for T = 0.7. At the critical point we 
reached t^ = 4.2 x 10^ and simulated 32 samples. 

These simulations were first presented in [jano8c| . Later, in jjANogA] , we sim- 
ulated a new set of 768 samples for T = 0.7, which shall constitute our main 
working temperature. For these new simulations we reached only ty^ = 10^^, since 



^Notice that C2+2 is the difference of two correlated quantities, so its error is significantly reduced 
from what one would expect from error propagation. We take this into account, as always, with the 
jackknife method (Appendix IbI. 
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for longer times we had observed finite-size effects (see Section 10.3. 2[ >. When dis- 
cussing our results at T = 0.7, we shall find it sometimes convenient to use the 
longer 63-sample simulation, instead of the shorter but more precise one with 768 
samples. 



10.2 



Aging and full aging in C(t, tw) 



We start our study of the non-equilibrium dynamics by considering the two-time 
correlation function. In Figure 10.1 on the following page we have plotted C(i, iw) 
for T = 0.6 (our lowest temperature), for a range of across 11 orders of magni- 
tude (equivalent to physical times from picoseconds to one tenth of a second). 
We observe the qualitative claims made in the previous section, namely 

• For each t^, C(i, iw) is a strictly decreasing function of t. 

• limt^oo C{t,tw) = 0. 

• As we increase t^, we begin to see an enveloping curve. This is the Coo(i) that 



we shall study in Section 10.6 



In the following we try to quantify these remarks. We begin by attempting to pa- 
rameterise the aging behaviour. It has been claimed in experimental work |jROD03| 
that C{t, tw) is well described by a full-aging behaviour, that is 



(10.18) 



at least in the range 10 < < 10 . To check for this in a systematic way, we 
consider the functional form 

C(t, ty^ : 



-fl:(iw) 



(10.19) 



If full aging is correct, the parameters A(fw) and a:(fw) should be constant for large 
This has been checked before fijiMoJ], but with much smaller statistics. 



We present the results of fits to ( 10.19 1 as a function of for our three subcritical 
temperatures of T = 0.6, 0.7, 0.8 in Figure 10.2 For each t^, we consider the range 



tw ^ t < lOtw (we want to avoid the short-times behaviour). These fits, and their 
errors, have been computed with the jackknife method explained in Appendix |Bj 
In our simulated range, the exponent Oi{t^) is an increasing function of t^, but 
its growth slows down noticeably. Still, the behaviour at = 10^^ seems beyond 
reasonable extrapolation. 

Notice that the oi{tw) exhibits some suspicious undulations (especially for T = 
0.6). This is a visual effect caused by the data correlation, as we discuss in detail in 
Section IB.3.3.2I 



In Section 10.4 we shall use our fits to (10.19) to study the thermoremanent 
magnetisation. 
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■to 




Figure 10.1: Temporal correlation function C{t,t„), Eq. (10.8 1, for T = 0.6. We plot the 
curves as a function of t for fw = 2^', where i = 1,2, ... ,18 (fw grows from bottom to top). 
The errors are smaller than the width of the lines. 





0.7 



Figure 10.2: Parameters in a fit of C{t, tw) to (10.19 1 as functions of t„. For T = 0.7 we use 
our longer simulations with 63 samples (cf. Figure \b^ . The evolution seems to slow down 
for large t^, but we do not yet see the full aging behaviour, in which these parameters are 
constant. 
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10.3 

Spatial correlations and the coherence length 

In this section we consider the issue of growing length scales in the spin-glass 
dynamics. The computation of characteristic length scales is a recurring problem 
in statistical mechanics, as well as in lattice gauge theory. In previous chapters we 
solved it by considering the second-moment definition (5^. Notice that, in the 



thermodynamical limit, this second-moment estimator is equivalent to 



_(oo) / / d^r r^C^{r,t^) 
^2 = \ CAD r ( , \ ■ 



The denominator in this equation is ;\;sg(^w)/ Eq. (10.15), which does not self- 
average. Therefore, this method is problematic in non-equilibrium studies, where 
one considers very large systems but not many samples (in our case A/'sampies ~ 100 
for r = 0.6,0.8, Tc and A/'sampies = 768 for T = 0.7). 

On the face of this problem, one could think of replacing the integral estimators 
by fits to some functional form for Ci^{r,t^) featuring the ^(iw)- However, in the 
spin-glass case we do not know this explicit functional form — we have at best a 
reasonable ansatz as to its scaling, Eq. ( io.i3[ > — so this is not a viable alternative. In 



addition, we would face the ever present problem of statistical correlations, which 
complicate fits and their interpretation. 

Therefore, we must go back to the integral estimators. In this respect notice that 
the C/^{r,t^), for fixed r, is self-averaging. The lack of self-averaging in ^2 stems 
from the fact that the integration volume grows with L. For a fixed size, it is the 
dismal signal-to-noise ratio in the long-distance tails which spoils the precision. 
But this is precisely the problem that one faces when trying to compute the inte- 
grated autocorrelation time from a time series (Section A.2.i[ > and we can borrow 
its solution: a self-consistent cutoff. 



10.3. 1 Integral estimators of characteristic length scales 

We start our discussion by simplifying our definition of the spatial correlations to 
take into account only values of r along the axes. We define 

1 ° 

C4(r,iw) = EC4(m,,iw), (10.21) 

z=l 

where the denote each of the lattice unit vectors. That is, we take the values 
of the correlation for a distance r along the axes as representative of the whole 
spherical shell of radius r. We shall study the goodness of this approximation in 



Section 10.3.4 



174 



The statics-dynamics correspondence and finite-time scaling 



Now, for positive integer k, we consider the integral, 

rL/2 

Jfc(fw) = / drr^Ci(r,t^). 







(10.22) 



Notice that our bound of integration is L/2, and not L, due to the periodic bound- 
ary conditions. Since we are considering the regime ^(iw) <^ i-/ this makes no 
difference. In addition, the actual numerical computation of Ij^ uses an error- 
dependent cutoff: we only integrate up to the point r^utoff where the relative error 
in C4(r, iw) rises above some threshold. This is the same trick used in the compu- 



tation of the integrated autocorrelation time in Section A.2.1 We shall discuss this 
issue in more detail in Section 



10.3.3 



With this notation, and assuming rotational invariance, the second-moment es- 
timate is 




(10.23) 



An alternative approach, considered in |kis96| , would be to identify ^(iw) with 
h{tw), which has the correct dimensions. However, this is only a good definition if 
a = and we want a definition that would work for any scaling behaviour of the 
type ( io.i3[ ). To this end, we note first that 



h t. 



t. 



Therefore, we can introduce the following definition 



hit. 



oc 



(10.24) 



(10.25) 



Notice that the coherence length is only defined up to a constant scale factor, so 
any of the ^fc,)c+l(^w) would work. 



Still, there is a systematic error involved. Equation (10.13) is valid only in the 



regime where 1 ^ r ^ L. The resulting bias is minimised by considering large 
values of k, which suppress short distances in the integrals. However, too large 
a value of k would give too much weight to the tails of the correlation function, 
where the signal-to-noise ratio is worst. A compromise in the value of k is clearly 



needed (cf. Figure 10.5 1. 

In Section 10.3.3 shall compare different estimators of the coherence length 
and demonstrate that, indeed, the integral ones produce a significant error reduc- 
tion. Furthermore, the issue of the choice of k is not as critical as it may seem. For 
the moment, let us proceed with the physical discussion, using as our preferred 
estimator ^1,2 (^w) • 



10.3.2 The dynamical critical exponent and finite-size effects 



We consider in Figure 10.3 our coherence length ^1,2 (^w) for T = Tc and for our 
three subcritical temperatures. For T = 0.7 we have plotted our simulations with 
63 samples, since they reach longer times. 



10-3 — Spatial correlations and the coherence length 



^75 




Figure 10.3: Growth of the coherence length for several subcritical temperatures and for 
the critical point. For T = 0.7 we use our longer simulations with 63 samples. 

As expected, the growth of ^(^w) is slower at lower temperatures. Furthermore, 
our curves follow a power law for a wide time range. There are some deviations at 
short times, due to lattice discretisation effects (our definitions do not make much 
sense for ^ < 3). Some of the curves also present some deviations for long times 
(this is most noticeable for T = 0.8). 

These can be interpreted as finite-size effects, even though we are simulating 
very large lattices (L = 80). Indeed, when studying non-equilibrium dynamics 
we assume that our system is in the thermodynamical limit, which in principle 
requires ^(iw) ^ ^- In practice, the bound depends on the numerical accuracy. 
From finite-size scaling we would expect the largest t^ for which L = 80 still 
represents L = 00 physics to scale as L > A:^i,2(^w)- We have estimated k by 
computing ^i2(^w) in simulations for L = 24,40 and noting where they diverge 



from their L = 80 counterparts (Figure 10.4). We conclude that at T = 0.8 the safe 



range is L > 7^i^2(^w)- For Tc, the equivalent bound is L > 6^i^2(^w)- 

In order to be safe, we have considered that ^1,2 (^w) is physically meaningful 
for 3 < ^(iw) < 10. For T = 0.6 the upper bound is not reached, while for T = 0.7 
it corresponds to t^ ^ 10^*^ and for T = 0.8 it corresponds to t^ ^ 10^. In this 
range, we have fitted the coherence length to a power law, in order to determine 
the dynamical critical exponent z, 

^(fw) = A{T)ti;^^^^ . (10.26) 

The results of these fits can be seen in Table lio.il We also include the maximum 
waiting time safe from finite-size effects, defined as ^1,2 (^w^'') = 10. 

The resulting values of z are very large, going from z(Tc) = 6.86(16) at the 
critical point to z{T = 0.6) = 14.06(25) at our lowest temperature. Also, we find 
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Figure 10.4: Finite size effects in ^{tw), obtained by computing the difference between 
?i,2(fw) for each of our lattice sizes and a power-law extrapolation. In particular, we define 
^Tii^vv) as the result of applying the fit to (10.261, computed for L = 80 in the range 
^ e [3, 10], to the whole temporal range. Then, for each lattice size we consider the relative 

difference (^w) - ^£Vw)]/?~2(iw). 



T 


-^samples 


[^min/ ^max] 


z 


^i/d.o.f. 


J. max 


0.6 


96 


[3,10] 


14.06(25) 


41.7/82 


9 X 10" 


0.7 


63 


[3,10] 
[3.5, 10] 


11.84(22) 
12.03(27) 


82.7/81 
52.7/71 


^ 2.2 X 10^° 


0.8 


96 


[3,10] 


9.42(15) 


17.1/63 


^ 4.3 X 10^ 


1.1 


32 


[3,10] 


6.86(16) 


18.7/46 


^ 3.5 X 10^ 


0.7 


768 


[3,10] 
[3.5, 10] 

[4,10] 
[4.5,10] 


11.45(10) 
11.56(13) 
11.64(15) 
11.75(20) 


86.9/76 
46.6/66 
40.1/58 
29.6/50 


^ 2.2 X 10^° 



Table 10.1: Value of the dynamic critical exponent z for several temperatures. For our 
simulations with less than 100 samples we have used the fitting range ^1,2 S [3,10], while 
for our 768-sample simulation at T = 0.7 we have needed to restrict the fitting range. For 
?i,2(^w) > 10, our simulations are affected by finite-size effects. We also report the value 
of i^^" for which this bound is reached (for T = 0.6 we never reach ^1^2 = 10, so this is an 
extrapolation). 
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Figure 10.5: Plot of the spatial correlation function Ci{r, t^) for several subcritical tempera- 
tures and = 2^°. We show the correlation multiplied by (left) and r"* (right), needed to 
compute hitw) and h{tw), respectively. The signal-to-noise ratio is the same in both cases, 
but for the latter the distances with maximum weight are closer to the noise-dominated 
region. The curve for T = 0.7 is the average over 768 samples, while those for T = 0.6,0.8 
were computed with 96 samples. 



that this exponent roughly follows the law 

z(T) = z(rc)^, (10.27) 

consistent with previous numerical and experimental studies [ jjongg^ ImarooaI . 

Furthermore, if we extrapolate the coherence length to a typical experimental 
scale of 100 s (^w ^ 10^*), we find 

(10.28) 
(10.29) 
(10.30) 
(10.31) 

which nicely compare with experiments jjongg^ [BER04B] . 

10.3.3 Comparison of different estimators of the coherence length 

In this section we consider some details on the implementation of our integral 
estimators and present a comparison of the different methods to compute ^(iw)- 
A naive numerical implementation of ( io.22[ > to compute ^1,2 already yields a 



^1,2 






0.6) 


= 14.0(3), 


^1,2 


^^w 


= iqI^, t = 


0.7) 


= 21.7(4), 


^1,2 


^tw 


= T = 


0.8) 


= 37.0(14), 


^1,2 




= 1014, T = 


Tc) 


= 119(9), 



considerable increase in precision with respect to ^2- However, we can still improve 



the computation. In Figure 10.5 we show how r'^C/^{r, t^) is very well behaved up to 
the point where it becomes compatible with zero, where the errors increase dramat- 
ically, spoiling the precision of the whole integral. We take this into account with a 
self-consistent integration cutoff, much like what we describe for the evaluation of 
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Figure 10.6: Left: Result of computing ^1,2 (fw) in two different ways for our 96 samples 
at T = 0.6. In the red curve we stop the integration at the cutoff point where the relative 
error in C4(r, fw) is greater than 1/3. In the blue curve we add the contribution of the tail 
from that point on, estimated with an extrapolation using a fit to ( 10.14I 1. The difference is 
small, but the second method maintains the power law behaviour until longer times. Right: 
Analogous plot for our 768 samples at T = 0.7. Now the effect is much less noticeable, 
since the increase in statistics has shifted the cutoff point. 



integral correlation times in Appendix A We integrate our data (interpolated with 
a cubic spline, although this choice is arbitrary) only up to the point where the 
relative error of C4(r, iw) becomes greater than 1/3. Thus, we reduce the statistical 
error at the cost of introducing a, hopefully small, cutoff bias. We can minimise 
this systematic effect by estimating the contribution of the tails with a fit to 



C4(r, t^ 



A 



1.5 



(10.32) 



This is the scaling function (10.14), using a = 0.4. The fit is computed for 
3 < r < min{15,rcutoff}/ where the signal is still good. This correction turns out to 
be only relevant for the largest times and this only for our simulations with ^ 100 
samples. For T = 0.7, where we have 768 samples, the cutoff distance is increased 
and the effect of the tail correction is hardly noticeable (Figure 10.6 1. 

We have claimed that this method yields a considerable increase in precision 
with respect to the computation of ^li^w), or with the evaluation of ^ from a fit. 
We offer two demonstrations of this. First, in Figure 10.7 we compare 47rJ2(iw) with 
Nq^it^), two quantities that should coincide (assuming rotational in variance). As 
we can see, the former is much more precise for the whole span of our simulation. 
As a second check, we have plotted in Figure 10.8 the coherence length for T = 0.7 
(where we have the best statistics) for several methods. All determinations are 
proportional, but the integral estimators are much more precise. 

As a final test, we check not only our determination of ^, but also its error 
estimate by comparing our 63-sample computation of |jano8c| with the improved 
768-sample one of I jjANogA] for T = 0.7. Figure 10.9 shows the difference in both 
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Figure 10.7: Computation of the spin-glass susceptibility for our 768 samples at T = 0.7 
using both its straightforward definition { w.^\ and the integral hitw) of ( io.22[ |. ;i(;sg(^w) = 
Nq^{tvf) = 471/2 (^w)- The second determination has been carried out using a self-consistent 
cutoff, which noticeably increases the precision. The inset details the upper-right corner of 
long times. 




Figure 10.8: Comparison of several methods to estimate the coherence length ^(^w) for 
our 768 samples at T = 0.7. We include the integral estimators ^1^2 arid ^04 of ([10.23 1, the 
second-moment estimate ^2 of ( io.2o| and the result of a fit to (10.13 1 using ( 10.14I and 
a = 0.4. All curves are parallel for large t„, but the integral estimators are much more 
precise. 
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Figure 10.9: Difference between the coherence length ^i,2(fw) computed with the 63 sam- 
ples of [ |jANo8c| and with the 768 of [ jjANogA) (the errors are the quadratic sum of those for 
each simulation). The curves are compatible in the whole time range. Mind the dramatic 
statistical correlation in the sign of this difference. 
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Figure 10.10: Left: Ratio of the errors in ^i,2(fw) for the simulations at T = 0.7 with the 63 
samples of [ |jano8c| and those of the 768 samples of [ jjANogAl (see text for discussion). The 
horizontal line is \/768 / 63 ~ 3.46. Right: Cutoff of the Ij^ integrals as a function of time for 
both simulations. 
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Figure 10.11: Level curves C4(r, fw) = c for c = 0.3 (dashed lines) and c = 0.1 (solid 
lines) at T = 0.6 (dotted lines are circles, for visual reference). The plot is restricted to the 
z = plane, for clarity. The innermost curve corresponds in both cases to fw = 4 and the 
succeeding ones correspond to geometrically growing times (fw = 4 x 16'). As we can see, 
the deviations from isotropy are mainly due to lattice discretisation (i.e., functions of r). 
The errors are smaller than the thickness of the lines. 



determinations (the error in the difference is the quadratic sum of the individual 
errors, since both sets of samples are disjoint). We have excellent agreement for the 
whole time range. Notice that the points in this curve are correlated in time, so the 
fluctuations of neighbouring points are not independent. 



Finally, we consider the error estimates in Figure 10.10 The expected error 
reduction is \/768/ 63 3.5, but we do not reach this value for most of our times. 



Notice that the relative statistical error in our error estimate is ^ 1 / y 2A/'sampies/ 

so the effect is not a fluctuation. The explanation is that the cutoff distance has 
increased for the simulation with 768 samples, which trades statistical error for a 
reduction of systematic biases. 



10.3.4 The isotropy of C4(r, t^) 

Throughout this section we have glossed over the issue of rotational invariance, 
assuming it is a good approximation. At all times we worked with correlations 
C4(r, iw) restricted to the directions along the axes, hence ignoring most of the 
N points for a given The main motivation for this approach has traditionally 
been to avoid the computation of the whole C4(r, fw)/ a task of 0{N^) in a naive 
implementation. However, as detailed in Appendix |A| we can turn the evalua- 
tion of an autocorrelation into the computation of two real Fourier transforms (the 
Wiener-Khinchin theorem). This latter task is 0(NlogN) with the FFT algorithm. 
Therefore, computing the whole C4(r, iw) implies no increase in numerical effort. 
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Figure 10.12: The coherence length computed with the whole correlation functions, 2 ' 
and with correlations along the axes, ^12. Both estimates coincide for large times and their 
errors are similar for the whole range. Data for T = 0.6. 



We examine in this section whether this complete correlation function is iso- 
tropic and whether we can take advantage of all the N point to reduce statistical 
errors in the determination of ^(iw)- The first question is answered in Figure 



10.11 



where we plot level curves C4(r, t^) = c for several values of c and iw As we can 
see, isotropy is recovered at quite small distances (remember we are only interested 
in ^ > 3). 

We next consider the possible improvement of our determination of ^(^w) with 
the three-dimensional correlation function. In order to use our integral method we 
must first average over spherical shells, 

„ , ^ N E|r|6[M,w+l) kl^Q(''/^w) 

Qk{n,tw) = ' z . (10.33) 

2^\r\e[n,n+l) ^ 

Notice that Qfc(0,^w) = r^Ci{0,t^) = 4o- 

We can use Qk{m, t^,) as we did r^C4(r, t^) 
in our integral estimators. We would expect the resulting coherence length (t^) 
to coincide with ^i,2(iw) for large iw However, if the fluctuations of C4(r, iw) are 
independent, will have smaller errors, due to the increase in statistics. In fact. 



see Figure [10.12 the statistical correlation between different parts of the lattice is 
so great that it renders the gain in precision negligible. 

In short, the usual approximation of considering only correlation along the axes 
introduces neither a bias nor an increase in statistical errors. Still, the compu- 
tation of C4(r, iw) could be rewarding if one needed to consider 4 with k > 2 



(Figure io.ij\ . 
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Figure 10.13: We compare the axes correlation r^Cj^{r,tvj) with the shell-averaged correla- 
tion Qiifftyi). Both functions have similar precision for small distances, but the second is 
better behaved in the tails. 



10.4 



The thermoremanent magnetisation 

The thermoremanent magnetisation of a spin glass is one of the easiest quantities 
to study experimentally. Indeed, it has been known since the 1980s that it decays 
a power law (see | gra87[Ipre88| and references therein), even for temperatures 
very close to the critical point, 3 

As we explained in Section 10.1.1 the two-time correlation C{t, t^) can be iden- 
tified with the thermoremanent magnetisation (we have to choose t^ 1 as our 
initial 'magnetised' configuration and consider its overlap with the configuration 
at time t ^ t^). Following [pARg/] we have fitted our data to the decay law 



C{t,t^ 



Ao{t^) + A^(t^)t'^'- 



(10.34) 



Notice that for a finite number of samples the correlation function does not actually 
go to zero for long times, hence the need for the finite asymptote Ao(iw)- This is 
actually also a problem for experimental studies | |gra87[ . 
Table 



10.2 



shows the results of fits to (10.34 1 our subcritical temperatures. 
The fits where computed in the range 10^ < t < t^^^, where t^^^ was given in 
Table 10.1 The lower bound of the fitting region can be varied along several orders 
of magnitude with no effect. Notice that the values of ;\;^/d.o.f. are always much 
smaller than one, due to the correlations. 



^The only deviations from this simple behaviour were observed for T > 0.98Tc. 



i84 



The statics-dynamics correspondence and finite-time scaling 



T 




C(fw) 


Ao(fw) X 103 


xl/d.o.f. 




0.6 ^ 0.55Tc 


2 
4 
8 
16 


-0.1525(23) 
-0.1495(22) 
-0.1459(20) 
-0.1430(19) 


2.6(6) 
2.8(8) 
2.5(10) 
2.4(12) 


14.7/64 
15.5/64 
17.4/64 
17.5/64 


2.14(5) 
2.10(5) 
2.05(4) 
2.01(4) 


0.7 ^ OMTc 


2 

4 
8 
16 


-0.1787(14) 
—0.1/65(13) 
-0.1733(12) 
-0.1704(12) 


1.47(25) 
1.8(3) 
1.7(4) 
1.6(5) 


23.3/50 
18.4/50 
18.9/50 
15.4/50 


2.067(27) 
z.041(z6) 
2.004(25) 
1.971(25) 


0.8 ^ O.ZSTc 


2 
4 
8 
16 


-0.210(8) 
-0.212(7) 
-0.208(7) 
-0.205(6) 


1.7(10) 
2.8(12) 
3.0(14) 
3.0(18) 


13.9/32 
11.1/32 
10.8/32 
8.43/32 


1.98(9) 
2.00(8) 
1.96(8) 
1.93(7) 



Table 10.2: Result of fitting the thermoremanent magnetisation to ( 10.34 '/ for three 
subcritical temperatures. We give the temperatures in terms of T^, to facilitate comparison 
with the experimental results of ( 10.33I . 
the diagonal chi-square per d.o.f. 



For each fit we give the two parameters and 
The last column shows the result of computing d 
cz, in order to consider the scaling of equation 
(T) (cf. Table [mT 



t < t 
temperature. 



10.371. The fits are computed for 10 < 
which accounts for the different number of d.o.f. at each 



The asymptote Ao(iw) is in all cases of about 10""^, small compared to the the 
smallest value of C(f, fw) that we reach for our finite t, of about 10~^. The decay ex- 
ponent is very small, T-dependent and exhibits a slight, but systematic dependence 
on (a tendency that was already observed in ||ms96|). 

The experimental values for c are, from |gra87|;^ 



c{0.55Tc 



-0.12, c(0.67rc 



-0.14, c(0.75rc 



-0.17, (10.35) 



These values are slightly higher than our results of Table io.2[ The reason may be 
that our values of c have been computed in fits where t and t^ differ by as many 
as 10 orders of magnitude, while in experimental work t/t^^ < 10^. 

We can recall here our study of full aging in Section k 
a fit in a narrower window of t^ < t < lOiw We can extrapolate our a(iw) to an 
experimental time of 100 s with a quadratic fit, such as we showed in Figure 
We thus obtain 



where we considered 



B.2 



-l/oi{t^ = 100 s) ^ -0.11, 
-l/a{t^ = 100 s) ^ -0.12, 
-l/a(iw = 100 s) ^ -0.14, 



r = 0.6 O.SSTc, 
r = 0.8 ^ 0.73Tc. 



(10.36a) 
(10.36b) 
(10.36c) 



4The errors are small, about the size of the plotted 



data points in Figure 3b 



of [ |gra87| . 



10-4 — The thermoremanent magnetisation 185 



These values still do not match the experimental values of (10.35 1. However, the 
difference between both sets seems to be roughly independent of temperature (this 
is best seen by plotting the parabolas defined by each of the sets of exponents). We 
believe this is due to an extrapolation error, with a similar bias for all temperatures. 



Recall now that the coherence length ^(^w) was also well represented by a power 
law. It follows that C{t,tw) should be a power oi ^{t + tw), at least for the small 
of Table 



10.2 



In particular we write 



(10.37) 



We could perform a fit to ( io.37[ > to find d, but we would have to contend with 
errors in both the x and y coordinates (see Section B.3.2). Rather, we combine 



equations (10.26 1 and (10.34 1 to find 



d 



cz. 



(10.38) 



Since both for C{t, tw) and for ^1,2 (^w) we are computing one fit for each jackknife 
block, we can apply this relation block by block to obtain d and its error estimate, 
without fitting for it directly. The result can be seen in Table 10.2 We obtain d ~ 2, 
with a temperature dependence at the limit of significance. 

This is a potentially useful observation for experimental work since, as we said 
before, the thermoremanent magnetisation is relatively easy to measure (compared 



with the coherence length, in any case fjongg]). Therefore, Eq. (10.37 1 could poten- 
tially be used as a definition of ^(fw) for experimental work. 

As we have seen, there are several systematic effects in our results, mainly the 
incompatibility of (10.35 1 ^^id (10.36) and the need for a finite asymptote Ao(iw) 
in ( io.34[ >. This suggests that (1034^ is perhaps not the best parameterisation. We 
can consider the alternative description 



C(t, t^ 



B(iw)exp[e(iw)(logf)^^'"^]. 



(10.39) 



1. Notice the difference of this 



This would reproduce a power law if /(iw) 
functional form with a stretched exponential, discarded in IGRA87I for our tem- 
perature range. The results of fits to ( 10.39! for several and fitting ranges can 
be seen in Table 10.3 We observe that the fit parameters are very stable to vari- 



ations in the fitting window. The value of /(^w) is incompatible with 1, a t least 
for T = 0.6,0.7. However, from the point of view of the ;\;^/d.o.f., both ( 10.34! ) 



and ( io.39[ ) are equally good. 



10.2 



We see that the exponent c{t^ 



Finally, let us go back to (10.34 1 arid Table 
is roughly linear in T. This suggests that the thermoremanent magnetisation could 
be a temperature-independent function of riog(i). We have tested this conjecture 



in Figure 10.14 and found it to be only approximate. 
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T 










2 


10^ 
10^ 


-0.236(7) 
-0.30(6) 


0.873(9) 
0.82(5) 


52.2/104 
13.9/64 


0.6 


10^ 
10^ 


-0.203(6) 
-0.25(4) 


0.909(8) 
0.85(4) 


47.9/104 
13.5/64 


8 


10^ 
10^ 


-0.176(4) 
-0.21(3) 


0.943(7) 
0.90(4) 


41.5/104 
13.9/64 


16 


10^ 
10^ 


-0.158(4) 
-0.19(3) 


0.968(7) 
0.92(4) 


38.1/104 
15.3/64 


2 


10^ 
10^ 


-0.263(4) 
-0.32(3) 


0.890(4) 
0.84(3) 


43.0/90 
14.4/50 


4 

0.7 


10^ 
10^ 


-0.230(3) 
-0.29(3) 


0.921(4) 
0.862(25) 


71.9/90 
12.8/50 


8 


10^ 
10^ 


-0.2003(23) 
-0.253(23) 


0.955(3) 
0.895(23) 


94.1/90 
13.0/50 


16 


10^ 
10^ 


-0.1768(20) 
-0.226(19) 


0.985(3) 
0.921(22) 


138/90 
10.6/50 


2 


10^ 
10^ 


-0.302(16) 
-0.5(3) 


0.891(16) 
0.77(15) 


45.1/72 
14.3/32 


4 

0.8 


10^ 
10^ 


-0.257(12) 
-0.6(4) 


0.934(14) 
0.71(17) 


63.6/72 
11.4/32 


8 


10^ 
10^ 


-0.223(10) 
-0.49(24) 


0.970(13) 
0.76(12) 


69.8/72 
11.1/32 


16 


10^ 
10^ 


-0.192(8) 
-0.40(19) 


1.008(12) 
0.81(12) 


65.9/72 
8.49/32 



Table 10.3: Parameters of a fit to Eq. (10.391, offering an alternative description of the 
thermoremanent magnetisation. For each temperature, we pre sent f its in the range fmin < 



t < t^^'^iT), where t„ 



10^,10^ and ^^'"'(T) is given in Table 10.1 
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Figure 10.14: Coherence length ^1,2 (^w) as a function of Tlogiw/ for three subcritical tem- 
peratures. Even if the three curves are not equal within errors, the overall scaling is sug- 
gestive. 



10.5 

Dynamical heterogeneities 

Thus far, we have characterised the spin-glass dynamics globally through the two- 
time correlation function. We have found this dynamics to be extremely slow and 
we have used a growing coherence length to monitor the evolution. 

However, the dynamics is actually heterogeneous across the system, with local 



regions behaving differently from the bulk. As we said in Section 9.1.3 the study 
of these dynamical heterogeneities is a potential point of contact between studies 
of spin glasses and of general glassy behaviour. 

The traditional numerical approach to dynamic heterogeneities has been the 
computation of coarse-grained correlation functions. These are defined as an aver- 
age over a cell of size with I <^ L. The characteristic length scale for dynamical 
heterogeneities can then be assessed through the probability distribution of these 
coarse-grained correlations. For £ much larger than the correlation length, one 
would expect the fluctuations to be averaged out, and the resulting coarse-grained 
correlation would resemble the global one. For small £, however, strong deviations 
from a Gaussian behaviour are present jjAuo/i ]. One defines, then, the correlation 
length as the crossover length. 

Here we follow a different approach, consisting in the study of the two-time 



spatial correlation C2+2(''/^^w) of Eq. (10.161. This observable was introduced 



in [ JAU07] , but the time scales accessible to that work did not permit the mea- 
surement of correlation lengths greater than a couple of lattice spacings. 
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Figure 10.15: The two-time spatial correlation function C2+2{^, i, i 
function of r, for several values of t. We consider a long waiting time of t 
63-sample simulations at T = 0.7. 



defined in (10.16 1 as a 
2'^^, for our 



We have plotted the correlation length C2+2{^,t,tw) (we use the same conven- 



tion for axially oriented correlations as for C4) in Figure 10.15 for a large waiting 
time. From this plot we can see, qualitatively, that the correlation reaches several 
lattice spacings for large times. 

Recalling the one-to-one nature of C{t,tw) as a function of t for fixed iw, we 
eliminate t as independent variable in favour of C^, using a cubic spline as our 
interpolating method. We consider, then, a correlation length ^C^, iw) from the 
long-distance behaviour of C2+2{^> t{^w> C'^), t^), in complete analogy with the case 
of ^(^w) and C4(r, tw), recall Eq. (10.17I. This correlation length is computed with 
the methods of Section 10.3.1 

We have plotted ^{C^t w) as a function of for several t^ at T — 0.7,0.8 in 
Figure 10.16 We observe that the evolution in t^ distinguishes two regimes. For 
large C the correlation length approaches a fw-independent enveloping curve. This 
corresponds with the low-i regime, which quickly approaches the translationally 
invariant sector of the dynamics. When drops below q^j^, however, we are in an 
aging regime, where the correlation length diverges in the large-iw limit. 

Since one would expect the divergence in the aging limit to behave as ^{tw), it 
is useful to consider the scaling variable 



R{c\t, 



ac\t. 



(10.40) 
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Figure 10.16: Correlation ler\gth ^{C^, t^) as a function of for several waiting times. We 
plot our results for T = 0.7 with 63 samples and for T = 0.8 (96 samples). 



r 


c2 


2^ 


4/d.o.f. 


z 


0.6 


0.200 


13.4(6) 


0.01/2 


14.06(25) 


0.7 


0.200 


11.14(20) 


0.69/3 


11.56(13) 


0.8 


0.100 


9.56(17) 


3.73/5 


9.42(15) 



Table 10.4: Value of the dynamic exponent of Eq. ( 10.43 ' for three subcritical tem- 
peratures. We also give our best determination of the dynamic critical exponent z from 
Table 



10.1 



We expect 

lim R{C^ < qlj^) > 0. (10.41) 



lim R{C^ > ql^) = 0. (10.42) 



We show this quantity for our three subcritical temperatures in Figure 10.17 For 
r = 0.7 we show both our longer simulations with 63 samples and our more precise 
set with 768. 

Clearly, there is a crossover, which is best observed for T = 0.6. However, we 
face a dilemma: for low T the growth of the coherence length is very slow and 
we do not see the large-C^ behaviour as clearly, while for high T the value of q^A 
is too low for us to appreciate the crossover at low (notice that the spin glass 
undergoes a second-order transition, so qEA{Tc) = 0). 

In short, while our data include long enough times to appreciate the behaviour 
qualitatively, they are not sufficiently clean for us to characterise the scaling be- 



haviour at the crossover point. In Section 10.8 we shall take on this problem again 



with equilibrium methods. We shall show how the crossover in the behaviour of 
dynamical heterogeneities can be interpreted as a proper phase transition and how 
its critical parameters, computed in equilibrium, can be used to characterise the 
dynamics. 
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Figure 10.17: Behaviour of the ratio C^i,2(C'^, fw)/^i,2(^w) as a function of for several 
values of tw We observe a crossover behaviour from a finite limit at small to a vanishing 
limit for large C^. Notice that for T = 0.7 we plot both our set with 63 longer samples and 
our set with 768 shorter ones. 



For now we address the simpler problem of the scaling of ^(C^, iw) in the low- 
sector. Since in this situation ^ and ^ differ only in a constant factor close to one, 
we consider the following ansatz 



,1/Zf 



^(C < i^EA' ^w) = A^fw , (10.43) 



where is expected to be similar to z. For each T, we have performed fits to ( 10.43 * 
for a value of expected to be below ijEA- We note that the value of q^A has 
consistently been overestimated in the literature, as we shall show in the following 
sections. In fact, in order to be safe we have simply fitted the lowest value of for 
which we could obtain an acceptable number of degrees of freedom in the fits. 



The results are shown in Table 10.4 where we can see that z^{T) is indeed 



similar to z(T), although we can measure it with much worse precision. 
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10.6 

The translationally invariant sector 



In Section |io.2| we left pending the question of computing the stationary part of 
the temporal correlation function. As we noted there, a naive extrapolation is 
complicated, because we have to consider the limit t^ 00, which is also strongly 
i-dependent. 

Here we address this problem, taking advantage of the characteristic lengths we 
have computed. In particular, we study the following dynamical variable 

^(^'^w) = \ ■ (10.44) 



Following the discussion of the previous section, the translationally invariant sector 
is recovered in the limit x — ^ (where x is essentially ^~^(iw) in its natural units 
for each i). However, this limit is much easier to compute in a controlled way than 
t^ 00. 

We have plotted C{t,tw) against x{t,tw) for our simulations at T = 0.7 in Fig- 
ure 10.18 We see that, indeed, the curve is very smooth in the small-x limit, and 
an extrapolation seems feasible (see Inset). Furthermore, the curves for different t 
become parallel as t grows, which suggests the existence of a smooth scaling func- 
tion C{t,tyf) = Coo(0 + f{x). In particular, we consider the following functional 
form 

C{t,x) = Coo(i) + ai{t)x + a2{t)x^. (10.45) 



However, a naive extrapolation using (1045]) is still problematic. The reason is that 



we have errors both for C and for x (i.e., for both the x and y coordinates in the 
fit). Luckily, however, the errors were similar in both directions and for all points. 



Therefore, we can use the method explained in Section B.3.2 We compute the fits 
for X < 0.5, using ^1^2 and ^1,2 as our estimates of the characteristic lengths. 

This method has a llowed us to compute Coo(0 with remarkable accuracy for 
^ ^ 10^ (Figure 10.19 1. Now that we have this function, we could try a second 
extrapolation to find q^A> 

lim Coo(0 = '?EA- (10.46) 

However, this second extrapolation turns out to be very difficult, because now we 
cannot avoid the issue of considering the infinite limit explicitly. Furthermore, we 
lack a convincing theoretical expectation as to the functional form of Coo(i)- 
The simplest option is a power law decay, 

Coo{t) = qEA + At~^. (10.47) 



This equation yielded very good fits for T = 0.6, 0.8 (Table 10.5 but the values of the 
exponents were very small, B ^ 0.05. The smallness of B makes the extrapolation 
extremely risky, since it means that q^A is going to be much smaller than our range 
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Figure 10.18: The temporal correlation function as a function of x{t,tw), Eq. 
an inset showing a close-up of the small-x limit (equivalent to an extrapolation to t 
Plot for our 768-sample simulation at T = 0.7. 



, with 
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Figure 10.19: Stationary part Coo(0 of the two-time correlation function C(f, fw) for T 



0.7,0.7,0.8, obtained from the extrapolations in Figure 10.18 
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T 


Fitting 


Log 


arithm 




Power law 






range 


CIt! a 
'7 nA 


xl/d.o.f 




B X 10^ 


xl/d.o.f 


0.6 


[10^, 10^] 
[W^, 10^] 


0.607(16) 

0.62(3) 

0.62(5) 


34.1/17 
7.23/14 
6.25/10 


0.730(8) 

0.733(14) 

0.726(24) 


5.7(4) 
5.8(7) 
5.4(12) 


31.2/17 
7.59/14 
6.32/10 


0.7 


[10^, 10^] 
[10^, 10^] 
[104, 10^] 


0.497(10) 
0.474(21) 
0.49(5) 


23.7/17 
18.9/14 
15.0/10 


0.656(5) 

0.637(11) 

0.63(3) 


6.16(18) 

5.5(3) 

5.4(9) 


32.6/17 
18.5/14 
15.3/10 


0.8 


[10^, 10^] 
[103,10^] 
[104,10^] 


0.371(13) 
0.368(24) 
0.40(6) 


6.50/17 
5.53/14 
4.31/10 


0.568(7) 

0.556(12) 

0.56(3) 


6.56(20) 

6.2(4) 

6.4(11) 


9.39/17 
4.27/14 
3.82/10 



Table 10.5: Estimate of q^A for three subcritical temperatures, using two different extrap- 
olating functions. For T = 0.6,0.8 both are very good, but at T = 0.7 (where we have 
better statistics) they are somewhat forced. This suggests that the real q^^ probably lies 
in between our two estimates. For the power law extrapolation, Eq. ( 10.47I / we also quote 
the exponent B. Notice that this exponent is not proportional to T. See Section 10.8 for a 
considerably more precise determination of this parameter. 



of values for Coo(i)- We note that this slow evolution has led some authors to 
overestimate q^A in the past (with larger errors. Coo seems almost constant). 

Perhaps more disquieting, it may seem that a low enough value of B would 
allow for the possibility that q-^A = 0. In order to dispel this notion, we have tried 
a second fit to a logarithmic decay, 

Ccoit) = qEA + gq:^ • (10-48) 

We stress that this ansatz lacks theoretical basis, we merely take it as a lower bound 
on the value of q^A- Numerically, it turns out, the logarithmic fit is as good as the 
exponential one for T = 0.6,0.8, even if it produces incompatible values of q^A 



(Table 105). 



Furthermore, if we try both fitting functions for our T = 0.7 data, where we 
have much better statistics, we find that they were somewhat forced. This leads us 
to conclude that the real asymptotic behaviour of Coo(i) is probably something in 
between ([1047 1 and (1048]). We can use the difference between both methods with 



a fitting window of f G [10"^, 10^] as our uncertainty interval, 

0.62 <qEA{T = 0.6) < 0.733, (10.49a) 
0.474 <qEA{T = 0.7) < 0.637, (10.49b) 
0.368 < qEAiT = 0.8) < 0.556. (10.49c) 

Even with our unprecedently long simulations, we are still at the threshold of being 



able to compute qEA with dynamical methods. In Section 10.8 we shall perform an 
equilibrium computation of this quantity. 
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10.7 

Equilibrium analogues and the time-length dictionary 

As we have seen, the non-equilibrium approach, while straightforward, is limited 
when one needs to perform some delicate analyses, such as the estimation of q^A- 
Therefore, one should complement it with equilibrium computations. Recall that 
we are working in a framework where the equilibrium phase is unreachable for 
experimental systems, yet conditions the non-equilibrium dynamics. Remember, 



for instance, the relation between the fluctuation-dissipation ratio of Figure 9.4 and 
the equilibrium p{q) [fra98| [FRA9 9]. 

In this section we take the qualitative statics-dynamics relation one step farther. 
Our proposal is that the equilibrium correlation functions computed for systems 
of finite size should reproduce their non-equilibrium counterparts in the thermo- 
dynamical limit for finite time. An infinite system with a finite coherence length 
^(^w) can, very roughly, be considered as a collection of systems of size L ^ ^(^w) 
in equilibrium, with some reservations. This relation should establish a time-length 
dictionary t^ ^ L, which is our objective in this section. 

In particular, we shall consider the equilibrium spatial autocorrelation, 

I 

^4 ('') = ]y E i^xqx+r) ■ (10.50) 



X 



We want to relate C4(r) to C2+2{i'>t,tw). As we have said, the waiting time 
iw can be related to L, but what of t? The answer lies in the, already exploited, 
one-to-one relation between t and the two-time overlap C{t,tw) at fixed In 
equilibrium, the dependence on C is replaced by the computation of correlation 



functions conditioned to a fixed value of q. Finally, we note that (10.50) is non- 
connected, unlike C2+2, as we defined it in ( io.i6[ >. Therefore, we shall consider 
here a non-connected version of C2+2, 



I 

€2+2(1', t,t^) = —YjCx{t,t^)Cx+r{t,t^) . (10.51) 

Our objective, then, for this section, is finding a relation between 

€2+2(1' 'KC{t,tw)),tw) < — >Ci{r\q). (10.52) 

We still have to provide a workable definition of the ^-conditioned C4:{r\q). In 

order to do this, let us step back for a moment and consider the pdf of the spin 
overlap. 



This definition is completely analogous to the pi (m) of (4.8 1 we considered in Chap- 
ter [3] Like in that case, the pi{q) is not smooth for finite systems, but rather the 
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sum of N + 1 Dirac deltas. Even if we are not considering a tethered formalism 
here, we can borrow the cure to this problem. In particular, we define our smooth 
ip{q) as the convolution of Tp\{q) with a Gaussian of width 1/ ^/N 



p{q = c) 



J_^dq' pi(q')GN{c - q') = (^^N (^c - ^J^q^c^ 




(10.54) 
(10.55) 



The pdf p{q) is an interesting observable in its own right, since, as we saw in Chap- 
ter I9I its thermodynamical limit is very different in the droplet and RSB pictures. 
We shall study it in detail in Section |ii.i[ 



Now, we can generalise our treatment of p(q) to define conditional expectation 
values at fixed q 



E{0\q 



Gn(c-^ Ex 

Of course, the standard expectation values can be recovered from the E{0\q), 

dq p{q)E{0\q). 



(10.56) 



O 



Finally, we define the fixed-t^ correlation function as 



(10.57) 



C^irlq) 



N 



YL^xq 



x+r 



1 



(10.58) 



In | |jANo8c| we compared our non-equilibrium C2^2(^ = 1/ ^' ^w) with the equi- 
librium C4(r = l\q), using equilibrium data from ||coNo6l [coN07Ai l, at T = 0.7. The 
result is plotted in Figure 



10.20 



We can see, how, indeed, the equilibrium correla- 
tion functions reproduce the non-equilibrium ones through a consistent time-length 
correspondence. More precisely, the non-equilibrium correlations at time 

can be 

matched to the equilibrium ones with the time-length dictionary 



Lit, 



3.7^i,2(^v 



(10.59) 



In ||jANo8c|| we used this equivalence to predict that t^ = 2'^^ would correspond to 
an L = 33 system. We note that, due to our discretisation of measuring times, we 
have only established this dictionary up to the nearest power of 2. 

In the previous analysis, we were limited by the equilibrium data, which only 
reached L = 20 and did not consider correlations for r > 1. We afterwards ran 
our own equilibrium simulations with Janus, reaching L = 32 and taking more 
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Figure 10.20: Plot of the off-equilibrium non-connected correlation C2^2('' = l/C^/^w) 
of Eq. ( lo.gil for several waiting times at T = 0.7. We compare it with the equilibrium 
C4(r = l\q), Eq. (10.58 using data from | jcoN07A i[ (continuous lines), finding that they are 
accurately matched through the time-length dictionary L 3.7^(iw). According to this, 
the correlation for = 2^^ would correspond to an L = 33 system. 



thorough measurements (see Appendix |E] for details on these simulations and ther- 
malisation checks). Our lowest temperature for our largest lattices was precisely 
T = 0.7. We simulated L = 8, 12, 16, 24, 32. We can now use this much more precise 
equilibrium simulation to improve on the conclusions of Figure 10.20 

According to the time-length dictionary, then, our L = 32 simulations would 
correspond to t^ ~ 2^^ and our L = 24 ones would correspond to t^ ~ 2^^. We 
have plotted both cases in Figure io.2i[ this time for several values of r, corrob- 



orating our prediction of jjANoScJ. Notice that in a finite lattice C4(r\q) can only 
be computed up to r = L/2, while the non-equilibrium C2^2(^' ^' ^w) is defined for 
arbitrary distance. However, we find a very good matching even for relatively large 
values of r. 



10.7.1 The experimental length scale 

As we can see, even relatively long times 2^2 MCS - 10^° MCS - 0.01 seconds, 
correspond to the equilibrium behaviour for quite small systems. This raises the 
question of the relevance of the thermodynamical limit for experimental physics. 
Indeed, for a typical experimental scale of 1 hour, that is, for a ~ 3.6 x 10^^ MCS, 



we can extrapolate the expected coherence length with equation (10.26 1 and the 
data in Table lio.il We conclude that 



L(iw) ^ 3.7^i,2(iw) 



1 hour i — > L = 110 (10.60) 
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Figure 10.21: Comparison of our non-equilibrium correlation functions C2_^_2{r,C'^,tw) 
(points) and our equilibrium C4^{r\q) (lines) at T = 0.7. In the top panel we compare 
L = 24 with = 2^^ and on the bottom one we consider L = 32 and t„ = 2^^. The errors 
in both data sets are comparable and smaller than the points. 
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That is, the equilibrium length scale relevant to the non-equilibrium experiments is 
not the thermodynamic al limit, but rather L ^ 100. 

This concept of experimental length scale will be extremely relevant in the next 
Chapter, where we try to elucidate the nature of the spin-glass phase. We shall find 
there that making extrapolations to L — > 00 is very difficult, but that extrapolating 
to L = 110 is safe. 

As a final comment, we point out that some authors have proposed a modified 
scaling for the coherence length, substituting the power law with a more complex 
functional form liBouoij . This was studied in Section 6 of | ijAN09Aj and found to 
fit our non-equilibrium simulations (the underlying reason being that 1 /z is very 
small and it is very difficult to distinguish a small power from a logarithm with 
numerical data). If this modified scaling were to hold instead of the power-law 
behaviour, the length scale equivalent to one hour would be smaller than L = 110. 



10.8 

The phase transition in the dynamical heterogeneities 

The previous section presented one of our most important results: the statics- 
dynamics equivalence can be made quantitative. We shall now take this observa- 
tion one step further and use the equilibrium correlation functions to understand 
the crossover in the behaviour of the dynamical heterogeneities. Recall that, for 
large C, the two-time correlation length ^(C, iw) reached a iw-tndependent value, 
while for small C it grew as the coherence length ^(iw)- 

Our dynamical study was precise enough for us to observe this phenomenon, 
but not to study it quantitatively. One of the major limiting factors was that the 
crossover value, q^A/ was very low, so even with our long simulations we did not 
have enough statistics in the low-C sector. However, one of the advantages of an 
equilibrium study is that we can cover the whole range q. To this end we can 
consider the behaviour of the connected correlation function at fixed q. Below q^A 
(and for T < Tc), one expects 

Ci{r\q) -q^ = + |^?| < ^?ea- (10.61) 

(10.62) 

The droplet and RSB pictures have very different predictions for the structure ex- 



ponent 6{q), which we shall study in detail in Section 11.3 For the moment, we 
only need that < 9{q) < D, on which both theories agree. In order to study 
these connected correlations in position space, one has to perform a subtraction 



that complicates the analysis (Section 11.3 and cf. |jcoN09|). Instead, in this Section 



we work in Fourier space. The scaling behaviour is now 

C4(fc|^?)ocfce('?)-D + ..., I^^l <q^A. (10.63) 
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Figure 10.22: Left: Plot of the dimensionless quantity Fq /Fq . Right: Same plot, but 
now we add factors of sin^(n7T/L) to avoid trivial free-field scaling corrections and plot 

Rq = [Fq^^ sin(7T/ L)] / [Fg^'' sin(27r/ L)] . In both cases the curves merge below q^A (as in 
a Kosterlitz-Thouless transition), but we cannot distinguish the critical point with much 
precision. 



On the other hand ]dom98[ ICON09] , 



This equation defines a correlation length - 
^ (jEA from above. 

In what follows we shall use the notation: 

fi"^ = Ci{nkmm\q), 



\q\> qEA- (10.64) 

:f. Eq. (5^ — which diverges when 



F, 



(1) 



(10.65) 



The two scalings ( 10.63 * ^^id ( 10.64 * for Q reproduce the crossover behaviour 
that we found when studying ^{C,tw)'- Fq ^ L^~^(l) for < q^A but f <j ^ 1 for 
I'?! > '^EA- In the large-L limit, the crossover becomes a phase transition where 



(10.66) 



Here v is a critical exponent, in principle different from the thermal critical expo- 
nent at Tc- This concept of a phase transition at T < Tc as we vary q may seem 
unorthodox for spin glasses. However, a very similar picture appears in the study 
of the equation of state for Heisenberg ferromagnets IBRE73] . 

We can study this phase transition using finite-size scaling, as we did for the 
Ising model and the DAFR Up to scaling corrections, we have 



fin) 



L°-^('?-)g„(l1/^(^-^ea)). 



(10.67) 
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Figure 10.23: Plot of Fq/U against for all our system sizes at T = 0.703 and two values 
of y. In the left panel we consider the free-field scaling Fq{k) cx k^^, that is, 1/ = 2. In the 
right panel we consider y = 2.35 D — 6{qEA) (see below). The insets are closeups of the 
crossing region and both are shown at the same scale. 



Notice that this approach also has close parallels with the condensation transition, 
which can be studied with Ising ferromagnetsj^] In what follows, just as in the 
previous section, we shall work at T = 0.703. 



As in previous chapters, we want to exploit (10.67 1 using phenomenological 
renormalisation. The first step in those analyses was finding some suitable dimen- 
sionless quantity and studying the intersections of its curves for different L. The 

only obvious such quantity that we have here is F^ ' /F^ ' . Unfortunately, these 
ratios do not show clear intersections. Indeed, in the case where 6{\q\ < ^ea) > 
the whole phase \q\ < q^A will be critical and we will be in a situation analogous 
to a Kosterlitz-Thouless transition [ KOS73] , where the curves merge, rather than 

suggests that this may be the case, but 



10.22 



intersect, at the critical point. Figure 
it is not possible to get a clear determination of q^A from these data. 

We have therefore adopted an alternative approach. Rather than looking for 
dimensionless quantities we consider the ratios Fq/U, where y is a continuous 
parameter. 



Fq/hy = L'-VGi {L'/%q - q^A)), e = D - d{q 



EAJ 



(10.68) 



Now, as long as y < D — 0(0), it follows that, in the large-L limit, F^/Ly vanishes 
for \q\ > q^A and diverges for \q\ < qEA- Therefore, the curves for these quantities 
as a function of q for different L will also intersect. We show two values of y in 
Figure 



10.23 



Now we can once again analyse these intersections with the quotients method. 
We consider pairs of lattices (L,sL). Then, operating with (10.68 1 we see that the 



5In this case, the magnetisation density plays the role of q, the spontaneous magnetisation that 
of q^A- Finally, 0(£Jea) = D and v = (D + 1)/D liBiso2l . The scaling function Gi(x) has, for this 
system, a discontinuity at finite x. 
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crossing points q^^^ scale as 



€y = + A-^L-^ + . . A- = , (10.69) 

where the dots denote corrections to leading scaling. A fit to this equation could 
produce the values of q-^A and of v. Recall that we tried, in vain, to estimate the 
former parameter using non-equilibrium methods. Notice that for an exponent 
y = e = D — 6{qEA) the intersection point is constant in L (neglecting scaling 
corrections, of course). 

Since we have simulated L = 8,12,16,24,32, our best option is to choose s = 2. 
In this case, for a fixed y, there will be three intersection points: (8,16), (12,24), 

(2) 

(16,32). However, there are also three fit parameters {qEA> and v), so we would 
be left with no degrees of freedom. 

We can get around this problem by considering n values of y at the same time. 
This way, we can make a joint fit of all the resulting intersections, with fit pa- 

(2) (2) 

rameters {qEA>'^> -^yi > ■ ■ •/ Ay„ } (that is, forcing the intersections for different y to 
extrapolate to the same q-^A arid with the same exponent). Such a procedure may 
(should) raise an alarm: we are taking two internally correlated curves and extract- 
ing many crossing points between them by varying a free parameter. Fortunately, 
the potentially dangerous effects of correlations can in this case be fully controlled 
by considering the complete covariance matrix. 

In particular, we have a set of measured data points {q^^y}, labelled by their L 
and their y. The sizes La are L\ = 8, L2 = 12 and L3 = 16, while the go from y\ 
to yn- Then, the appropriate chi-square estimator is 

!,/=l a,h=\ 

In this equation, o'(^ia){ib) is the covariance matrix of the data (we need two coor- 
dinates to identify each point: its L and its y). In this case, unlike in our non- 
equilibrium analysis, the full covariance matrix is treatable. 



Figure [10.24 shows the result of this fitting procedure. The selection of the num- 
ber of y is arbitrary: the more y we add, the more degrees of freedom (information) 
we have, but at the cost of increasing the effects of correlations and endangering the 
conditioning of the covariance matrix. Fortunately, what seems a delicate choice 
turns out to be arbitrary, both the fit parameters and their errors being remarkably 
resilient to changes in the yy. For our choice of (shown in Figure io.24[ >, the final 
results are 

^?EA = 0.52(3), l/i? = 0.39(5), (10.71) 

with ;\;^/d.o.f. = 18.9/16. Notice that our value of q^A is consistent with the 
(wide) bounds that we obtained in our preliminary non-equilibrium analysis in 
Section lio.6[ 
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(2) 

Figure 10.24: Crossing points q\^y computed for pairs of lattices (L,2L) and several values 
of y, against L^^^^. The continuous lines are fits to (10.691 constrained to yield common 
values of v and q^A- 
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Figure 10.25: Scaling plot of the quantity Rq of Figure 10.22 using the critical parameters 
computed in (lo./ib. 
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As pointed out above, the result is very stable to changes in the values of y. 

For instance, removing the L = 8 data for y = 2.3,2.4 (the outliers) shifts our final 

I I (2) 

values by one fifth of the error bar. Notice that, from Figure 10.24 Ay changes 
sign at y !=s 2.35, so 6{q-EA) ~ 0.65. We shall return to this exponent in the next 
Chapter. 

As a test of these critical parameters, we can attempt to produce a scaling plot of 
the ratio Rq that we had plotted in Figure 10.25 This is represented in Figure 10.25 



which shows that, indeed, a good collapse is obtained. 

Let us finally note that our value of 1/v = 0.39(5) is compatible with the best 
known result for the thermal critical exponent 1/v = 0.408(25) [haso8b | | , so it may 
be that the two are equal after all. We shall consider other conjectures for 1/v and, 
in particular, its relation with 6{q), in Section [11. 3} 



10.9 

The finite-time scaling paradigm 

In the previous section we obtained an important result: the spin-glass order pa- 
rameter t^EA- This had eluded us in our previous non-equilibrium study (Sec- 
tion 10.6 1 and had also never been done in a controlled way in equilibriumH 

We also saw how the critical parameters thus obtained could produce scaling 
plots of dimensionless observables in equilibrium. 

We can now consider a complementary problem. Given the equivalence be- 
tween L and ^(^w) through the time-length dictionary, should it not be possible to 



produce a dynamical scaling plot? The answer turns out to be yes. In Figure [10.26 
we have plotted the dimensionless ratio ^(C, iw)/^(iw) as a function of the scal- 
ing variable (C^ — q^j^J^itwY^^ ■ The values of v and q^A used were directly those 



of (10.71 ). As we can see, we obtain a collapse of the curves for large t^ 



Figure 10.26 suggests that the correct manner to treat the statics-dynamics equiv- 



alence in a fully quantitative way is to adopt a 'finite-time scaling' (FTS) formalism. 

The FTS approach provides a natural explanation for the extremely small expo- 
nents found in our power-law extrapolation of Coo(0 (Section io.6[ >. Indeed, FTS 
implies that 



Cco(0 -^?EA OC^co(0 



-1/v 



U{t) = lim Cit.U 



(10.72) 



Finally, we expect 



^00 (0 oc i^?. 



(10.73) 



and, in fact, (see Table 10.4 1 the evidence suggests that is equal to the dynamic 



^One naive way to estimate q^A is to consider the evolution of the peaks in the p{q) as L grows. 
However, not knowing the exponent v that controls this finite-size evolution we would not have 
enough degrees of freedom for a reasonable fit. 
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Figure 10.26: Finite-time scaling plot: dimensionless ratio ^(C, iw)/^(^w) against the scal- 
ing variable {C^ - ql^)^{t^f'\ 



critical exponent z. Therefore, the exponent B of Section 10.6 

Coo(0 = q^A + At-^, 



(10-47 > 



IS 



1 



zv 



0.0325. 



(10.74) 



We have used the value of z(T = 0.7) = 12.03(27) from Table 10.1 This agrees at 



least in order of magnitude w^ith the results of Table 10.5 (which is about as much 
as we could have hoped for, given the problems in estimating B from dynamical 
data). 

We finally note that recent improvements in the experimental measurement of 
response function with space-time resolution suggest that l,{t,t^) may soon be 
experimentally accessible [lOUKioj . Therefore, our finite-time scaling framework 
has potential implications for experimental work. 



CHAPTER XI 



The structure of the D = 3 spin-glass phase 



In this chapter we consider the finer details of the structure of the D = 3 spin-glass 
phase. In particular, we try to decide which of the competing theories (RSB, droplet, 
TNT) offers a best description. To this end, we begin by examining the would-be 
triviality in the spin and link overlaps which, as we explained in Chapter [gj is the 
main differentiating characteristic between the three pictures. 

This analysis is carried out on the light of our previous results for the statics- 
dynamics equivalence. In particular, we stress the fact that the theory with most 
experimental relevance is the effective one at L ^ 100, rather than the one that 
best describes the thermodynamical limit. In this sense, we shall conclude that the 
RSB framework provides the best picture of the experimental spin glass. Still, this 
large-L limit is also interesting from a theoretical point of view. Therefore, we also 
attempt infinite-volume extrapolations, carefully checking for finite-size effects. 

We conclude this chapter (and our study of spin glasses) with a detailed analysis 
of the structure of correlations in the spin-glass phase. 



11.1 

The spin overlap 

We begin our study by considering the most straightforward observable: the spin 
overlap q. As we explained in Chapter |9| the behaviour of this quantity is one 
of the clearest markers to distinguish the droplet picture from the RSB onej^ In 
particular, droplet predicts a trivial p{q) in the thermodynamical limit (two deltas 
at ±i^EA), while RSB predicts a non-trivial behaviour (non-zero probability density 
for all \q\ < q^A)- 



We have plotted the p{q) of Eq. (10.54), ^ Figure [11^ for T = 0.703 (the lowest 



^The TNT picture coincides with RSB in the predictions for this observable, but we shall not 
mention it explicitly in this Section. 
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Figure ii.i: Overlap probability density function p{q), Eq. (10.54I, at T = 0.625 and T = 
0.703. Notice that for the central sector of ~ the curves for the different system sizes 
quickly reach a plateau with p{q) > 0. 
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T T 

Figure 11.2: Overlap density distribution function at zero overlap as a function of tem- 
perature. We observe an enveloping curve with a linear behaviour, as expected in an RSB 
setting. 



temperature for L = 32) and for T = 0.625 (the lowest for L = 24). The curves 
are very smooth, thanks to the Gaussian convolution of (10.54). According to the 
previously mentioned theoretical predictions, one would expect the peaks to get 
narrower and closer together as L grows. The shift in position is very clear (recall 
from the previous Chapter that q^A at T = 0.703 is ^ 0.52) but a more careful 



11.1. 2D 



analysis is needed to assess the change in width (Section 

In addition, the probability density at q = shows no evolution, favouring the 
RSB scenario. In order to see it more clearly we have plotted p{q = 0) at T = 0.703 
in Figure 11.2 for all our lattices. 



We can see a clear non-zero enveloping curve. 
More precisely, in a mean-field setting one expects this probability density to be 
linear in T below | |mez87 [. This expectation is checked in the right panel of 
Figure 
for L 



11.2 



The seemingly out of control value of p(0) at the lowest temperature 
8 is an artifact of the binary nature of the couplings (a finite system always 
has a finite energy gap). The fact that finite-size effects in p(0) are stronger close to 
T = than at finite temperature has been studied in liPALOij . 

From a droplet model point of view, Moore et al. [' MOogSp have argued that the 
apparent lack of a vanishing limit for p(0) in numerical work in the 1990s was an 
artifact of critical fluctuations. In fact, at Tc, p(0) diverges as L^^^ while droplet 
theory predicts that, for very large lattices, it vanishes with the stiffness exponent 
y ~ 0.2 as L~y, for all T < Tq. These authors rationalise the numerical findings 
as a crossover between these two limiting behaviours. However, a numerical study 
at very low temperatures (so the critical regime is avoided) found for moderate 
system sizes a non- vanishing p(0) [ jKATOil . Furthermore, we shall compute in Sec- 
tion 11.1.2 a characteristic length for finite-size effects in the spin-glass phase, which 
turns out to be small at T = 0.703. 

In any case, the behaviour at the experimentally relevant scale of L ^ 100 seems 
without a doubt to be non-trivial (there is no room for a change of regime between 
our simulations and L ^ 100). 
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Figure 11.3: Infinite volume extrapolation of the Binder parameter at T = 0.703 and T 
0.625 and fits to the behaviour expected in the RSB, Eq. (11.3b I, and droplet, Eq. (ii^ajl. 



pictures (cf. Table |ii.i| . For the experimentally relevant scale of L = 110 (dotted vertical 
line), both fits are well above the B = 1 value of a coarsening system. 



T 




Droplet fit 






RSB fit 








a 


y 


X^/d.o.f. 


c 


d 


0.703 
0.625 


3.78/3 
2.00/2 


0.312(17) 
0.289(16) 


0.110(17) 
0.134(21) 


3.44/3 
2.73/2 


1.165(12)[34] 
1.128(11)[33] 


0.186(34) [03] 
0.193(28) [03] 



Table 11.1: Scaling of the Binder parameter and fit to the behaviour expected in the droplet, 
Eq. ( ii.3a| , and RSB pictures, Eq. ( ii.3b| . 



ii.i.i The Binder cumulant 

We can define the Binder ratio for spin glasses just as we did for the Ising ferro- 



magnet in (5.10) 



BiT) = ^. (11.1) 
if) 

Notice that we compute these moments from the original piicj), not from our 
smoothed version. Above the fluctuations of q are expected to be Gaussian 
in the large- L limit, hence 

lim B(T) =3, T > T^. (11.2) 

Below Tc the situation of course depends on whether the droplet or the RSB pictures 
are correct. In the former, B{T < Tc) should approach 1 in the large-L limit (as in 
a ferromagnet). In the latter, one expects 1 < B < 3. 
Therefore, we have the following expectations 

Droplet : B{T; L) = 1 + al'V, (ii-3a) 
RSB: B{T;L) = c + dL-'^^\ (11.3b) 
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Figure 11.4: Left: q^p^iL) as a function of the temperature. We include two different infinite- 
volume extrapolations. The first uses the dynamical heterogeneities exponent \ /v, cf. ( 11.3I 
and ( [11.3 1. The second is obtained from finite-size scaling arguments in the critical region, 
Eqs. ( ii.io| and ( 11.13I . Right: Width of the peaks of p{q), Eq. ( 11.4I 1, as a function of T for 
all our lattice sizes. 



The droplet prediction ( ii.3a[ > depends on the stiffness exponent y, while in the 
RSB picture the finite-size evolution is controlled by the exponent l/v = 0.39(5), 
which we computed in the previous Chapter. 

We have plotted the Binder ratio for T = 0.703 and T = 0.625 in Figure 11.3 



along with fits to (11.3 1. The resulting parameters are gathered in Table 11.1 For 
the RSB fit we include two error bars: the number enclosed in (•) comes from the 
statistical error in a fit with fixed v and the one in [•] is the systematic error due to 
our uncertainty in v. 

As it turns out, both fits have acceptable values of x^/ d.o.f.. However, the evo- 
lution of B with L is very slow, so in order to accommodate the droplet behaviour 
we have needed a very small exponent y ~ 0.12, smaller than the usual droplet 
prediction oi y ^ 0.2 |bra87[ . 

Let us finally note that, again, the extrapolation at L ^ 100 is well above one 
even with the droplet scaling. 



11.1.2 The peaks of p{q) 

As we have seen, the droplet and RSB pictures have very different predictions for 
p{q) as a whole. However, they both agree in that the two symmetric peaks ob- 
served at finite L should eventually become two Dirac deltas at ±qEA> a prediction 
that we can check. 

Let us begin by defining i^EA(^) as the position of the maximum of p{cj;L) 
(in the remainder of this section, we consider all overlaps positive and the pdf 
symmetrised). Again, our Gaussian smoothing procedure makes this quantity easy 
to compute (we fit the neighbourhood of the peak to a third-order polynomial, since 
the peak is very asymmetric). 

In order to test that the peak not only tends to q^A, but also gets infinitely 
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L 


T 


= 0.703 


T 


= 0.805 




(7 




(J 


C^p('?EA(i')) 


8 


0.1177(20) 


0.1784(10) 


0.1391(25) 


0.1833(10) 


12 


0.0963(21) 


0.1740(12) 


0.1165(25) 


0.1809(12) 


16 


0.081 7(16) 


0.1696(11) 


0.1001(22) 


0.1756(11) 


24 


0.0735(16) 


0.1690(12) 


0.0860(19) 


0.1728(12) 


32 


0.0668(29) 


0.1631(23) 


0.0798(34) 


0.1669(22) 




16 




16 




;;tVd.o.f. 


0.43/1 




1.13/1 




B 


-0.278(28) 




-0.346(30) 





Table 11.2: Width a = ((7^+) + )) /2 of the peaks in p{q) and fit to a power law cr{L) = 
AL^ in the range [Lmin/32]. We also include the product ap [q^Ai^-)) ■ 



narrow, we can employ the half-widths at half height. Defining q^^") through 

p(i^(±)) = p[q^p^{L)) / 2, we have 



a 



(±) 



(±) 



^ea(£')| 



(11.4) 



Notice that, since we are considering the positive peak, q^~^ < qEAiQ < q^'^'' (that 
is, q^~^ is the inner width and q^~^^ the outer). 

We have plotted q-EAi^) and tr^^) in the left and right panels of Figure 
respectively. We see that the width of the peaks does decrease slo wly w ith L 
However, the product (rp((^EA(i-)) has a small dependence on L (Table 
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We can now consider the actual value of q^^- Following Section 10.8 we expect 



11.2 



1 + 



(11-5) 



We cannot perform a three-parameter fit to (11.5), due to the lack of degrees of 
freedom, but we need to use our previously computed value of 1/v = 0.39(5). 
Similar extrapolations were attempted in | ini96[ , but with less control orvl/v (and 
smaller sizes, L < 16). 

We present the values of qEA{^) arid the result of a fit to Eq. ( 11.5 1 in Table |ii.3[ 
As we can see, the errors due to the uncertainty in the exponent, denoted by [ • ], are 
greater than those caused by the statistical error in the individual points, ( • ) . In 
fact, our data admit good fits for a very wide range of values in 1/v. For instance, 
if we try to input the value of the stiffness exponent obtained in the droplet-like 
extrapolation of the Binder parameter, y ^ 0.12 (see Eq. (11.3a) and Table [iTT] >, we 
still obtain a good fit, even though the extrapolated value for q^A is almost zero at 
T = 0.703 and negative at T = 0.805. Therefore, using the droplet exponent y the 
spin-glass phase would be non-existent. 

Also included in Table [11.3 is the confidence interval for this observable com- 
puted from non-equilibrium considerations in Section 10.6 Notice that the equi- 
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0.703 



0.805 



8 

12 
16 
24 

32 



0.82461(83) 
0.793 33(85) 
0.773 00(75) 
0.740 27(71) 
0.7174(14) 



0.7818(11) 

0.7412(11) 

0.71681(95) 

0.67905(83) 

0.6535(16) 



Section 10.^ 



16 

1.83/1 

0.538[22](6) 

0.474 < qBA < 0.637 



16 

0.98/1 

0.447[24](6) 

0.368 < qEA < 0.556 



Table 11.3: Extrapolation to infinite volume of qEA{l-,T) using the replicon exponent, 
Eq. ( ii.3| . We also include the confidence interval previously obtained in the non- 
equilibrium study of Section 10.6 The resulting estimate for T = 0.703 is compatible with 
our previous determination from the phase transition in the dynamical heterogeneities in 
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T 






Lc(Te-T)^ 


0.703 


1.253[20](32) 


1.78[8](11) 


0.197[8](13) 


0.75 


1.448 [24] (34) 


2.58[12](16) 


0.210[8](13) 


0.805 


1.731[28](44) 


4.08[18](27) 


0.221[10](15) 


0.85 


2.023[32](54) 


6.09[26](42) 


0.222[10](15) 


0.90 


2.514[41](66) 


10.63[44](71) 


0.230[10](15) 



Table 11.4: Determination of from the fits to Eq. (11.5 1, using (11.81, for several temper 
atures below T, 



11.3 



Errors are given as in Table 
a correlation length when T approaches (v 

that the x^/d.o.i. for the fits at T = 0.85 and 0.90 are, respectively, 2.6/1 and 2.7/1. 



The characteristic length ( T) scales as 
2.45 from | haso8b |). We warn the reader 



librium values are much more precise, but consistent. The extrapolations included 



in this table (and analogous ones for other values of T) are plotted in Figure 11.4 



Finally, notice that the estimate of cj-^AiT = 0.703) in Table 11.3 is compatible with 
our value of q^AiT = 0.703) = 0.52(3), obtained in Section 10.8 with a completely 
different method. 



Critical and finite-size effects 

In order for our extrapolations to the thermodynamical limit to be meaningful, 
we have to check that we are not in a preasymptotic regime dominated by critical 
fluctuations. This is best done through a finite-size scaling analysis of the p{ci). 

Let us first show that our estimate of q^A provides a determination of the corre- 
lation length in the spin-glass phase. Notice that we are in a situation were the cor- 
relations decay algebraically, so the concept of correlation length is delicate [ jos66| . 
In particular, finite-size effects are ruled by a crossover length Lc(T), scaling as a 
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correlation length: Lc oc (Tc — T) In fact, one would expect 



qEA{T) 



l + h[L/L,{T)]. (11.6) 



We do not know the complete crossover function h, but we do know that for large 
X it should behave as h{x) ^ x~^^^, so we make the simplest ansatz, 

h(x) = x-^l\ (11.7) 



Then, if we compare (11.6) with ( ii.5[ >, we see that the amplitude of the finite-size 



corrections in the latter can be written as 

A(T) = [Lc(T)]i/^ (11.8) 



The resulting values of Lc can be seen in Table 11.4 Notice that this crossover length 
does scale with temperature as expected for the bulk behaviour, which constitutes 
a weighty argument for the asymptotic nature of our results. 

In order for our previous extrapolations for r^gA to be valid, we need to stay in 
a temperature regime where L ^ Lc(T), a condition that is amply satisfied for our 
working temperatures of T = 0.703 and T = 0.805. 

In the region close to Tc, where L becomes smaller than Lc(T), we can use finite- 
size scaling to extrapolate cj^AiT)- This somewhat unconventional use of finite-size 
scaling was started in fLUEgi , K1M93], ICAR93A} ICAR93B] and has also been used in 
the spin-glass context fFAL99^ j6ro6J . Most of the times, these ideas are used in 
the paramagnetic phase, but we show below how to implement them in the low- 
temperature phase. 

Close to Tc, we know that 

q^^{T) = \{T, - r)/^[l + ^(Tc - + ...]. (11.9) 

We have excellent determinations of Tc and jS from the work in ||haso8b 1, so we need 
only estimate the amplitude A. In fact, Wegner's confluent corrections (Tc — T)^^ 
are small close to Tc. To proceed, we note that finite-size scaling tells us that 

^?ea(L, T) = L-/^/^f(x)[l + L-'^G(x) + ...], X = l1/^(Tc - T), (11.10) 

where the critical exponents are (from ||haso 8b1), 

v = 2.45(15), |6 = 0.77(5), a; = 1.0(1). (n-n) 



In order to connect Eq. ( ii.io[ > with the infinite-volume limit in Eq. ( [11.9 1 the asymp 



totic behaviour of the scaling functions F(x) and G(x) must be for large x 

F{x) - x^ G{x) - x'^\ (11.12) 



The resulting scaling plot is represented in Figure 11.5 Varying the values of 
Tc and the critical exponents inside their error margins does not make significant 
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Figure 11.5: Scaling plot of y = q'EA{l-fT)L^^^ in the critical region below T^, following 
Eq. ( ii.io| and using the values given in P haso8b( for the critical exponents and T^. In- 
set: Close-up of the region near in the representation of Eq. ( 11.13I 1, showing a linear 
behaviour for large L. 



changes in the plot. Notice how the curves collapse for small values of the scaling 
variable x and large L, but how for our lowest temperatures scaling corrections 



become important. In fact, Eq. (11.10) implies that when the temperature is lowered 
away from Tc the amplitude for scaling corrections grows as x^^ ~ x^-^^. 

In order to estimate the amplitude A we shall concentrate on the small-x region 
where finite-size scaling corrections are smallest. Disregarding scaling corrections 



in (11.101, 



{qML,T)L^/^) 



1//3 



F(x)i/^ X. 



(11.13) 



The inset of Figure 11.5 shows that we reach this asymptotic behaviour for L > 24. 
Then, using the simplest parameterisation, F(x) = (A^^^x + B)^, 



^?ea(L,T) = A(rc-T 



1 + 



/3B 



\U^(Tc-T)L^ 



Iv 



+ 



(11.14) 



We can fit our L = 32 data for x < 0.4 (where the curves for L = 24 and L = 32 are 



compatible) and use the resulting value of A to extrapolate in Eq. (11.14) to infinite 
volume. This extrapolation is represented as a function of T in Figure 11.4 It is 



clear that this critical extrapolation differs with the extrapolation from ( 11.5 1 at most 



by two standard deviations. The difference, if any, could be explained as Wegner's 
confluent corrections. However, to make any strong claim on confluent corrections, 
one would need to estimate the error in the critical extrapolation. Unfortunately, 
we have found that this error estimate is quite sensitive to the statistical correlation 
between Tc, v , and j6 (as far as we know, the corresponding covariance matrix has 
not been published). 
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One could be tempted to compare Eq. (11.14) with Eq. (11.5) and conclude 
V = V. In the previous Chapter we observed that, at the numerical level, v = 
2.45(15) IIhasoSbII and v = 2.6(3). However, we do not regard this as fireproof. 
Indeed, it is a consequence of our somewhat arbitrary parameterisation F(x) = 
(A^/^x + B)^. To investigate this issue further, the small-x region is not enough. 
One is interested in the asymptotic behaviour of F{x) for large x where unfortu- 
nately corrections to scaling are crucial. A careful study of the crossover region can 
be done only by considering corrections to scaling both at the critical temperature 
(at q = 0) and below the critical temperature (at q = q^A)- 

Finally, the reader could worry about the applicability of ( ii.9[ > well below 
Tc- The issue has been considered recently within the framework of droplet the- 
ory liMOOioil . It was found that is adequate for all T < Tc (actually, no 
Wegner's scaling corrections were discussed in liMOOioil ). Thus, the fact that our 
data are describable as scaling behaviour with leading Wegner's correction does 
not imply that they are not representative of the low-temperature phase. 



11.2 

The link overlap and overlap equivalence 

In the previous section we showed that our numerical simulations favour a non- 
trivial scenario for the spin overlap, at least for experimentally relevant scales. This 
is strong evidence against the droplet picture of the spin-glass phase, but still leaves 
undecided the issue of TNT vs. RSB. 

The difference between these two theoretical descriptions is best examined through 

I I (1) 
the link overlap, introduced in Section 9.2.3 In the notation of real replicas {si } 

(2) 

and {sx }, this is 

^ {x,y) 

Notice that in a system of D spatial dimensions, the number of links is N\ = ND. 
Also, 

C4(r = l) = (Qlink)- (11-16) 

The link overlap is arguably a better fundamental quantity to describe the spin- 
glass phase in D = 3 than q jMAR99[ , |cono3A[ , |cono6I| . First, as was explained in 



Section 9.2.3 it is more sensitive than q to the differences between RSB and the other 
theoretical scenarios for the spin-glass phase (RSB expects a non-trivial behaviour 
for Qlink/ which is trivial in both the droplet and TNT pictures). Second, while 
Qlink is just q^ in the mean-field model, the link overlap is actually more amenable 
than q to an analytical treatment in D = 3 (within the RSB framework) jcoN03f 
|CON03B[ , ICON07A] . In particular, Qunk is a more convenient quantity to study some 
properties of the spin-glass phase such as replica equivalence I^RgSj iparooI or 
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ultrametricity [i cono7b] |^ 

In short, for RSB system we expect to have overlap equivalence: fixing should 
also fix Qiink/ even if the relation between these two observables in D = 3 is some- 
thing other than an identity|3] 

In this section we shall study the issue of overlap equivalence using both a 
non-equilibrium and an equilibrium approach. 



II.2.I Non-equilibrium study 

From a non-equilibrium point of view, we can consider the two-time link overlap 

\ 

Qmk(i/iw) = YL Cx{t,tw)Cy{t,tv,), (11-17) 



where Cx{t,tw) was defined in ( lo./] ). Using the usual substitution of for t 



as independent variable, we can explore the would-be differences in the aging of 
the spin and link overlap with this equations and, hence, distinguish between the 
coarsening dynamics of droplet and TNT and the non-coarsening evolution of RSB. 
The analogue to Eq. ( io.io[ > is now 



(Qlink) = lim lim CurvkitJw) = lim lim Ciink(i(C ,tw),tw)- (11.18) 

In particular, for a coarsening system, Ciink(C^, iw) should be independent of 
for C2 < q^j^ and large tw Indeed, in this regime, the relevant system excitations 
are the reversal of coherent domains where the overlap is already q^A (as low 
as it can get, since there are no states for smaller \q\). Since these domains have a 



vanishing surface-to-volume ratio (cf. Section 9.2.3 ), these excitations do not induce 
a finite change in Cii^k/ even if they do modify C^. Therefore, in the large- limit. 
Clink should reach its equilibrium value (Qiin k) at = q^^ and not decrease any 



further. This behaviour is illustrated in Figure 11.6 showing the Ciink(C , fw) of the 
D = 2 Ising model (the most straightforward coarsening system). 

On the other hand, the overlap equivalence property of the RSB picture trans- 
lates into an equal aging for C and Cynk- Here, when decreases below q^j^ it 
is because new states are constantly being found, involving a change of a domain 
with a volume-filling surface and, hence, changing C^nk- Therefore, Cynk is a non- 
constant function of for the whole range (again, for the Sherrtngton-Kirkpatrick 
model. Clink = C^). 

Let us see where our simulations stand. We have plotted the Ciink(C^, iw) of the 



Edwards- Anderson spin glass at T = 0.6 in Figure 11.7 The behaviour at our finite 
times seems to reproduce the RSB prediction. Still, the slope of the curve decreases 
(slowly) when increases. 



^We note that the replica equivalence property is studied in Section 7.2 of [ jjANioA| using Janus' 
equilibrium simulations. We plan to study the issue of ultrametricity in a future work [ jjANXxl 

3 In this respect we recall a computation in a finite-connectivity mean-field model that yielded 
Qlmk = aq^ + h [ferioI . 
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Figure 11.6: Plot of Ciink(C^/fw) for the D = 2 Ising model at T = 0.66Tc (results from 
simulations of an L = 4096 system, averaged over 20 thermal histories). Below = q^p^ 
the function approaches a C-independent value for long times — here qEA = where my 



is Yang's magnetisation (5.13 1. 



0.8 




0.0 



0.2 



C 



2 0.4 




0.6 



Figure 11.7: Plot of Cij^{C^,tw) for our spin-glass simulations at T = 0.6. The behaviour 
for our simulated times reproduces a non-coarsening behaviour, even if the slope decreases 
with increasing f„. We also show an extrapolation for a typical experimental time of 100 s. 



which is still far from the coarsening behaviour of Figure 11.6 
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Figure 11.8: Derivative of the link correlation function, regarded as a function of C^, for 
our spin-glass simulations at T = 0.6. For fixed C^, the derivative decreases with time but 
an extrapolation to an experimental scale of 100 s is well above zero (grey line, the width 
represents our error bars). 



In order to analyse this behaviour more quantitatively, we have interpolated the 
link correlation function at fixed with the lowest-order polynomial that allowed a 
fair fit (seventh order for iw ^ 2'^^, sixth for longer times, where we cover a smaller 

range). We then computed dCiink/d(C^) by differentiating the interpolating 
polynomial (Figure ii.8[ >. 



Notice that for small the derivative seems to reach a plateau for a widening 
range (suggesting that the relation between and becomes linear, as in 
the mean-field model)j4] 

Now, we would like to extrapolate these correlation functions and their deriva- 
tives to an experimentally relevant time scale, to see whether or not they reach the 
coarsening behaviour. In order to do this, we consider C{rt^, ty^) and C\\^\^{rty^, 



for r = 8,4, . . . , ^ and fit each of these functions to fl^ + ^r^w'^' • The parameters 
are stable to variations in the fitting range, as long as > 10^ and Cr ~ 0.5. 
We then use the fits to extrapolate both t he link and spin correlation functions to 
= 10^'^ ^ 100 s. Additionally, recalling ( |ii.i8 >, the limiting behaviour for = 



is estimated by extrapolating Qiink(^w) to the same experimen tal tim ej^ The result 
ing Ciink(C^, iw = lO^"*) is plotted with black crosses in Figure The computed 
points fall on a straight line, whose slope we can use to estimate the derivative 



dQink/ d(C^) |f^^;^oi4. This is plotted in Figure 11.8 with a thick grey line (the width 
of the line represents our error interval). 



4For T = 0.6 we lack a good determination of c^ea but, using the logarithmic extrapolation of 



Section 
5We 



10.6 



as a lower bound, we expect q^j^ > 0.4. 



did not include this point in the original study in ||jano8c| . 
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In short, the behaviour at a typical experimental time is still far from that of a 
coarsening system. One could be tempted to use the same procedure to extrapolate 
to iw 00, but the resulting errors are so large as to permit any kind of behaviour. 

We need some more theoretical input on the scaling of Cij^k in the coarsening 
scenario in order to consider the infinite-time extrapolation. 

We consider a large droplet of size ^{t + t^/) at time t + t^ that, at time tw, was 
made of Afc smaller droplets of size ^(^w)- The number of spins in the droplet's 
boundary scales as ^(^w)^*"- In a simple coarsening system (such as a ferromagnet), 
Ds = D — 1, but we can actually have D — 1 < Dg < D | |fis88aI |mcm84[ |bra87[ 



Ii'Is86b^Ifis88b| . In fact, the numerical simulations carried out in the TNT framework 
expect D — Dg ~ 0.44 [ipalooi |PAL03i l in D = 3. 

The scaling of A/c is straightforward: A/c ^ [^(i + iw)/^(iw)]^- The overlap of 
each of the droplets at time t^, with the combined droplet at i + iw is randomly 
it^EA- Therefore, one expects 



C(t, t^ 



at. 



D 



at. 



at+t. 



D/2 



(11.19) 



This equation is intuitively evident, but we note that for an Istng ferromagnet (the 
paragon of coarsening models) it can be actually obtained with an explicit compu- 
tation. Indeed, for an Istng model, it is shown in jcuG94B] can be written as a series 
in a parameter y, where 



[at+t^r+at^)f^^ 



(11.20) 



Equation (11.19I then follows in the ^{t + t^) ^ ^(^w) limit. 
As to the link overlap, we expect 



Qink(^/ 1^ 



Cfink+M 



C 



at. 



at+t. 



(11.21) 



That is. Clink cannot decrease below some finite value C^^^^, which is nothing but 
the equilibrium value (Qiink)- The excess over this value at finite t comes from the 
probability that a link belong to the surface of a droplet at time 



We can now rewrite (11.19 1 in the form 



c 



gjC) 

C2 



(11.22) 



where g{x) is some continuous positive function, not necessarily differentiable at 
X = 0. Combining the previous expressions we can rewrite ( ii.2i[ ) as 



Ciinkit, iw) = Cfi^k + Cl^^g{C)at. 



(11.23) 



This equation explains the behaviour observed in Figure 11.6 for the Ising ferro 
magnet. For this system, the D — Dg = 1, so we would expect C\[j^{C^,tv, 



cO 

•^link 
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Figure 11.9: Check of (11.23 ' for the Ising model (the vertical line is C = ^Jea)- 



to scale with ^ . In Figure 11.9 we can see that this is indeed the case — ^notice 



that for the I sing m odel the coherence length scales as t\^^ see, e.g., |jBRA94|. 



Equation ( 11.23 * su ggests plotting dQink/ d(C'^) |c=c* against ^ ^ ( 
QM(^ \ ^ggg Figure 11.10). It is important to choose a value Cl below q-^A but 



[t^) and against 



not too small, because otherwise the numerical derivative would be unreliable for 
lack of data. Therefore, we have chosen values smaller than, but close to, our lower 



bound for t^EA in Section 10.6 The two representations are linear within our errors. 
However, while a ^~^(iw) scaling compatible with standard coarsening seems fal- 
sified (i.e., its extrapolation is well above zero), the TNT scaling of ^~*^ '*^(iw) does 
extrapolate close to zero with our data. 

It is also interesting to attempt a similar representation to Figure 



11.9 



for the 



Edwards-Anderson model. Unlike the Ising case, now we do not know the exact 
value of Cjl^j^. However, by definition. 



lim Qlink(^ 



(Q 



link/ 



•^link- 



(11.24) 



Of course, Qiink(^w) = C/^{r = l,tv,). Then, following (11.231, in a coarsening system 



Qink(C^, ^w) - C4(r = l, ^w) ^ ^{U 



^D,-D 



(11.25) 



We have plotted this difference in Figure 



11.11 



for a value of C* below our lower 
bound for q^A in Section |io.6[ Again, we represent both the standard scaling 
and the TNT scaling of ^^^^ pj-om the first one, it is easy to find a smooth 
extrapolation to a p ositive difference. However, now the data are more precise 
than in Figure 
only be possib 



ii.io| and the TNT scaling of ^ ^■^'^ does not fit our data and would 



e if the whole of our simulation were in a preasymptotic regime. 
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Figure ii.io: Plot of dCiink/d(C^) against for Dg = 2 (standard coarsening, left) and 

Dg — 2.55 (TNT scaling, right). We compute the derivatives at a value of C* below our 
lowest estimate for (^ea at each temperature. We also mark by crosses our extrapolations to 
an experimental scale of 100 s. 




Figure ii.ii: Difference Ciink(C ^/^w) — Ci{r = l,t„) against ^ using the same values 



for C* and Dg as in Figure 
case. 



11.10 



An extrapolation to zero seems unlikely even in the ^ 



-0.44 
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There is another problem with the TNT scaling and it is that (11.231 relies 
on (11.191, ^ri equation that is disproved by our computation of exponent d in 
— we obtained d ~ 2, rather the = 3/2 compatible with (11.191. 



10.4 



Section 

In short, the infinite-time extrapolations remain inconclusive, leaving room for 
a TNT-like coarsening behaviour. However, in all cases the extrapolations to the 
experimentally relevant time scale, much safer, show that the dynamics is certainly 
non-coarsening there (thick line in Figure 11.8 and crosses in Figure ii.io[>. 



II.2.2 Equilibrium study 

We can attempt an analogous study with our equilibrium simulations. The role of 
Qink(C^, iw) is now played by ^{Q\mk\'^), where the conditional expectation value 



was defined in ( io.56| >. 

Just as in the non-equilibrium case, we expect Qiink to be a strictly increasing 
function of q^ below q^p^ for an RSB system, and to be constant for a TNT system. 
Now the large-^w limit is of course replaced by a large-L limit and the experimental 
scale is, as always, L ^ 100. 

q = 0.523) for our simulations 
ia8|)p| According to the TNT 



In Table 11.5 we give C^{r = \\q = Q) and C4(r = ' 
at T = 0.7 (where t^EA = 0.523(32), from Section 

picture, these two quantities should scale as L"*^ *** and have the same extrapolation 
to infinite L. As the reader can check, however, the data do not fit the 
behaviour, let alone extrapolate to the same large-L limit. The same conclusion 
holds if we replace L by 

£ = 7r/ sin(7r/L), (11.26) 

more natural for lattice systems. Yet, it could be argued that our data are preasymp- 
totic, so we may try a TNT extrapolation with corrections to scaling 



C^{r = l\q) 



Coo + ^(jL 



-0.44 



(1 + B^L- 



(11.27) 



We have performed a joint fit of the data from Table 11.5 to this equation, using a 



chi-square estimator analogous to that of ( 10.70 1. The fit parameters are the four 



amplitudes Aq, Bq, A0.523/ B0.523/ the common scaling corrections exponent x and the 
common extrapolation Coo- In this case the correlation of the data for q = Q and 
q = 0.523 at the same L is very small — see Table |ii.5| — and, of course, there is no 
correlation between data at different L. The result is 



Coo = 0.677+0;0J2, ^ ^ 0.57^^-11 x^/d.o.f. = 9.1/4. (11.28) 

Notice the highly asymmetric errors. The fit is not a very good one, its T-'-value (cf. 
Appendix |B]) is of only 6 %. 

Therefore, the TNT behaviour is seen to be, at best, very forced with our numer- 
ical data. In the RSB setting, however, we expect C4{r = l\q) to scale as ^ L~^, with 



^Remember that Ci{r = l\q) = E(Qiink|^j)- 
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L 


C(1|0) 


C(lkEA) 


n 


8 


0.46138(82) 


0.57253(33) 


0.134 


12 


0.51649(71) 


0.60390(28) 


0.051 


16 


0.54552(60) 


0.62089(22) 


0.060 


24 


0.57573(77) 


0.63742(17) 


-0.119 


32 


0.59131(94) 


0.64579(24) 


0.063 



Table 11.5: C(r = l\q) for q = and = qEA for all our system sizes at T = 0.703. For each 
L, we include the correlation coefficient between both values of q. 



a (^-dependent infinite volume value Coo(^)- Indeed, if we fit the data in Table 11.5 

to C{l\q) = Coo{cj) + A/i we obtain 

Coo(O) = 0.6349(8), ;\:Vd.o.f. = 3.63/3, (11.29) 

Coo(^?ea) = 0.671 1(2), x^/d.o.f. = 2.86/3. (11.30) 

Using L instead of £, we would have had to restrict the fit to the largest lattices, but 
the results would be similar (if with larger errors), as the reader can easily check. 



11.2.2.1 The variance of the link overlap 

We have seen that Qunk is a strictly increasing function of q'^. However, in order for 
these two variables to be interchangeable, we need a second condition. That is, the 
conditional variance of Qiink at fixed q must vanish in the large-L limit. In this way, 
fixing q would unambiguously fix Qunk also. 

In general, we define the conditional variance of O at fixed q = c as 

Yeir{0\q = c) = E{0^\c) - E{0\c)^. (11.31) 

Notice that 

(02)-(0) =/ dqp{q)[Yar{0\q)+{E{0\q)-{0)Y]. (11.32) 
J —00 



In Figure 11.12 we plot Var(Qiink|i^) for all our lattice sizes at T = 0.703. We 
find a very clear decay of the variance. The decay exponent Var(Qiinkl^) ^ 
is compatible with c = D/2, although we lack a theoretical argument for this 
particular value. 
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0.04 




Figure 11.12: Conditional variance ysiTi{Q\mk\'^) for all our lattice sizes at T = 0.703. We 
rescale the horizontal axis by the qEA{l-), in order to see a better collapse. 



11.3 

The structure of correlations in the spin-glass phase 

In the previous sections we examined two of the most straightforward character- 
istics of the spin-glass phase: the question of (non)-triviality in the spin and link 
overlaps. We interpreted our study as a way of distinguishing between the three 
competing pictures of the spin-glass phase: droplet, TNT and RSB. 

Of the three, RSB fit our finite-size data better and also produced more con- 
sistent extrapolations to the thermodynamical limit (assuming our data are repre- 



sentative of the asymptotic behaviour, a possibility supported by Section 10.3.2 1. In 
any case, even accepting the possibility of a crossover at very large sizes that would 
completely change the behaviour in the thermodynamical limit, the conclusion that 
the RSB picture describes the spin-glass phase at the experimentally relevant scale 
of L ^ 100 seems safe. 

Still, stating that the RSB framework seems more faithful to the physics of the 
Edwards-Anderson model would not end the discussion. Recall that this theory 
was introduced as the solution of the mean-field version of the Edwards- Anderson 
model. In D = 3 its more general points (such as the non-triviality of the spin and 
link overlaps studied thus far) are expected to remain valid, but many fine details 
are certain to change. 

Among these we have the issue of the structure of the correlations in the spin- 



glass phase. In Section 10.8 we said that the connected correlation function of an 
RSB system was expected to decay as C4(r\q'^ < q'^^) — q'^ ^ r~^^'l\ where the 
structure exponent 6{q) is positive for < \q\ < qEA- This clustering property 
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is a consequence of the existence of the replicon, a critical Goldstone mode. In 
particular, the RSB result above the upper critical dimension (D > 6) is |dom98[ 
IDOM99] 

9{q = 0) = D-4:, (11.33a) 
e{0 < \q\ < qEA) = D - 3. (ii-33b) 
e{\q\=qEA) = D-2. (11.33c) 

These results are expected to renormalise in lower dimensions | |domo6 | | . In partic- 
ular, if we want the spin-glass phase to have the clustering property, these values 
are inconsistent in D < 4 (because then the correlation function would grow with 
distance). On the other hand, some authors ICON09] expect the mean-field pre- 
diction for q = q^A riot to change in D = 3 (so the singularity in Fourier space 
remains k~^, as for Goldstone bosons). Notice that the mean-field study predicts 
a discontinuity at q = 0, which may or may not be present for D = 3. A theo- 
retical conjecture ||domo6II , suggests that 6{0) = (D — 2 + //)/2, which, using the 
anomalous dimension from ||haso8b|| , would give 9{0) = 0.313(5) in D = 3. 

Using our numerical data, we can try to estimate 9{q) directly for D = 3 and, 
therefore, increase our knowledge of the RSB spin-glass phase. At the same time, 
a study of the connected correlations provides an additional way of distinguish- 
ing between the RSB and droplet pictures. In particular, a droplet system is not 
expected to have the clustering property. Rather, it expects 

C4{r\q) = qlAfr/rir/L), \q\ < q^AA <^L, (ii-34) 

where fy/y is a direction-dependent scaling function with fr/r{0) = 1- On the other 
hand, for \q\ = q^A/ the connected correlation is expected to vanish in the large-r 
limit, decaying with the stiffness exponent y [ braS/] . We could, then, summarise 
the droplet expectation as 

0{\q\ < ^?ea) = 0, (11-35) 

e{\q\ = qEA) = y- (11-36) 
The exponent 9{q) is actually closely related with the critical exponent 1/v that 



we introduced in Section 10.8 Indeed, we can prove the following hyperscaling 
law 

e{qEA) = 2/v. (11.37) 

In order to do t his, le t us consider a similar scaling argument as we used for the 
DAFF in Section 
Then, 



7.2.1 



We start by adding an interaction hqL^ to the Hamiltonian. 

^{h) - h-"^ = h-^^^'\ (11.38) 
On the other hand, from the decay of the correlation function at the transition point 

CMqEA)-qlA-r-^^'^^'-\ (11-39) 
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combined with the definition { 2.i'y\ , we immediately read off 

e{qEA) = D-2 + fj. (11.40) 
Now, from ( ii.38[ > and ( ii.4o[ > and the hyperscaling relation ( 2.i9d[ > we have 



e{qEA)=2{D-v- 
The field h has the effect of changing q-^A, 

dqEA _ ^D-e(<?EA) 



dh 



so 



[^Ea(^) -^Ea(O)] ^ /^l--4D-e(^EA)) ^ ^il-v„D)/v,^ 

where we have used (11.38) and ( |ii.4i[ >. Finally, by definition of v, 
Comparing these last two equations we get 



1 1 



(11.41) 

(11.42) 
(1143) 
(11.44) 

(1145) 



which, when plugged into ( ii.4i[ >, produces our sought relation (11.37). Notice 
that the combination of this hyperscaling law and our result l/v = 0.39(5) gives 
^('?ea) = 0.78(10). Recall that in Section 10.8 we obtained a very rough estimate of 
^(^ea) ~ 0.65, which is therefore compatible with this scaling law. 

However, both of the above values for 6{qEA) are incompatible with the droplet 
prediction of 9{q'EA) = J/- Remember that the values of y in the literature are 
typically close to y ~ 0.25 [1CAR021 [BO E04 , BOE05]. Furthermore, by forcing our data 
to follow the droplet scaling in Section 11.1.1 we obtained an even lower y ~ 0.12. 



Aside from this exact scaling law, we can formulate some additional conjectures. 
Indeed, assume that 0(0^) = 6{0 < \q\ < ^ea) < ^{qEA)- Then, we can write 



Ci{r\q) - q^ 
On the other hand, 

Ci{r\q) - q^ 



Ml) 

1^(0+) 



+ ..., 



< 1^1 < qEA- 



(11.46) 



B{q) 
L''('?ea) 



+ 



q ^ ^?EA- 



(1147) 



Notice that the decay in (11.461 is slower than that of (11.47I. Therefore, A{qEA 



must be zero. Close to q^A we can consider, then the expansion 



(11.48) 
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If {q — q^A) is of order L one expects both (1147) and (11.48 
therefore 



to hold and. 



e{o < \q\ < qEA) = e{o+) < e{q^A 



e{o+) + ^ = eiqEA) 



(11.49) 



Notice that the combination of this conjecture and the scaling law (11.37 1 gives the 
additional conjecture 

e{o+) = i/v. (11.50) 

In the previous equations, we have written 6{0~^) to accommodate the possibility 
of a discontinuity at q = 0. 

Let us finally note that, at the critical temperature. 



e{0) =l + rj = 0.625(10), 



(11.51) 



where we have used the value of t] from ||haso8b| . 

In this section we shall use our spin-glass simulations to compute the exponent 
6{q) and decide between the RSB and droplet expectations. 



1^.^.^ Non-equilibrium study 

As we have mentioned before, in the RSB scenario, the non-equilibrium spatial 
correlation gives us access to the q = physics (we start with q = Q ait^ = Q and 
remain there, since there are equilibrium states with vanishing order parameter). 
Therefore, one expects the following long-distance behaviour of Ci^{r,t^), 



r— >oo 



1 



,0(0) 



f{r/atv.)). 



(11.52) 



Notice that this is just ( 10.13 ), but now we have reinterpreted the decay exponent a 
as = 9{0). 

The statement that 6{0) = for droplet systems then translates in that the 
spatial correlation at fixed r/^{tw) should not vanish in the large- limit. This is, 
of course, just the behaviour that one expects in a coarsening system. 

Recalling the integrals 4(^w)/ defined in (10.221, and our choice of ^i,2(^w) for 
estimating the coherence length, we have 



2-9(0) 



(11-53) 



We have plotted Ii against ^(^w) in Figure 11.13 ^ = 0.6,0.7,0.8, Tc. We also 
include, as a comparison, the same plot for the D = 2 site-diluted Ising model, a 
system that we know follows the 6{0) =0 coarsening behaviourj^] From this plot we 
see that the value of 9{0) for our subcritical temperatures is clearly different both 



7We could simply have used the ferromagnetic Ising model, but this slightly less trivial model 
serves to illustrate the issue of superuniversality. The simulations are for an L = 4096 lattice with a 



25% dilution, averaged over 20 samples at T = OMT, 



Jsing 
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Figure 11.13: Plot of ^i,2(fw) as a function of /i(fw) for our three subcritical spin-glass 
runs and for our run at T = 1.1 ^ T^. We also include, as a comparison, the analogous 

plot for a D = 2 site-diluted Ising model. We expect a behaviour ^ h, with a 

different 9{0) in each case. The diluted Ising is a coarsening system, so 0(0) =0 and 
^12 1^^. The critical simulation is well represented by 0(0) = 1 + ?/ (full line, taking 
the anomalous dimension from flHASoSBl V On the other hand, the subcritical spin-glass 
simulations show an intermediate value of 0(0), different both from the critical and the 
coarsening predictions. 



r 


samples 


[^min/ ^max] 


z 


0(0) 


4/d.o.f. 


4/d.o.f. 


0.6 


96 


[3,10] 


14.06(25) 


0.359(13) 


41.7/82 


49.0/82 


0.7 


768 


[4,10] 


11.64(15) 


0.397(12) 


40.1/58 


60.4/58 


0.8 


96 


[3,10] 


9.42(15) 


0.442(11) 


17.1/63 


12.2/63 


1.1 


32 


[3,10] 


6.86(16) 


0.585(12) 


18.7/46 


26.1/46 



Table 11.6: Computation of the replicon exponent 0(0) with non-equilibrium methods. For 
each temperature, we include the range of ^1^2 in which we computed the separate fits for 
Ji and ^1^2 arid the Xd'^^-^-^- each fit. Notice that the fits for ^1^2 are the same ones 
reported in Table 10.1 
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from the coarsening behaviour and from the critical value. On the other hand, the 
curve at T = Tc seems to follow the expected 6{0) = l + tj behaviour. Notice also 
that 0(0) is at least very similar for all the subcritical temperatures, in accordance 
with our theoretical expectation (but in clear contrast with the behaviour of the 
dynamic critical exponent z, which was inversely proportional to T). 



In order to compute the actual value of 0(0) we could in principle fit li to (11.53 * 
as a function of ^12- Notice that these variables are highly correlated, which should 
reduce the statistical errors in 0(0). However, we would face the complicated prob- 
lem of fitting strongly correlated data to a functional form y = f{x) with errors in 
both the X and y coordinates. We faced quite the same difficulty when estimating 
the exponent d measuring the decay of the thermoremanent magnetisation in Sec 
tion 



law. 



10.4 and we can adopt the same solution. In particular, we fit Ii(iw) to a power 

Jl(iw) 



Bt 



and recall that ^1,2 (^w) = ^^w^/ Eq. (10.26 1. Therefore 



0(0) 



cz. 



(11-54) 



(11-55) 



a relation that we can apply for each jackknife block. 

The results of following this procedure are quoted in Table 11.6 ^ As we can 
see, the values of 0(0) below Tc are not actually compatible. However, the value at 
T = 0.8 is probably affected by critical effects, while the values at T = 0.6,0.7 are 



actually very close (see also Section B.3.3.1 for some technical issues with these fits). 
Therefore, we use the interval between the computations at T = 0.6 and T = 0.7 as 
our confidence interval for the replicon exponent. 



0(0) = 0.38(2) 



(11.56) 



This value is in good agreement with a previous (but much less precise) ground- 



state computation giving 0(0) 
we have 



0.4 at r = [ iMAROii . On the other hand, at T = Tc 



0(0) = 0.585(12), T = Tc. (11.57) 

This value is a couple of standard deviations away from the estimate we quoted 



in (11.51). This difference is at the limit of statistical significance and could be due 
either to corrections to scaling or to a small cutoff error in our computations of 



the coherence length and Ii (we have fewer samples at Tc, see Section B.3.3.1 for a 
discussion of this issue). 



11.3.2 Equilibrium study 

Let us now examine the scaling of the equilibrium connected correlations. We 
shall first carry out a study in real space, using the previously computed 0(0) as 



°For T = 0.7 we use our simulations with 768 samples. See Section 103^ for our choice for the 
fitting range. 
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Figure 11.14: Spatial correlation function C4^{r\q = 0) at T = 0.703. We show on the right 
panel a rescaled version using the replicon exponent 9{0) = 0.38 and the scaling variable 
r/L. 



a starting point. Then, as in Section 10.8 we change into Fourier space, more 



convenient for a fully quantitative analysis. 
11.3.2.1 Real space 

We first concentrate on q = 0, the region where the droplet and RSB prediction are 



most different. Here, see Figure 11.14 the correlation function is seen to go to zero 
for large distances. Furthermore, in order for the droplet scaling as r/L of ( |ii.34[ ) to 
work, we need to rescale our data by a factor L^^^\ using our previously computed 
value of 0(0) = 0.38(2). 

For \q\ > 0, in principle we have to compute explicitly the connected correlation 
Ciirlq) — q^. This subtraction is problematic for finite lattices |jcoN09|, so instead 
we take care of the large-r background by considering the differences 

C4(r = L/^q) - C4(r = L/2\q) - L'^^'?), (11.58) 



We see in Figure |ii.i5 that the above differences scale with L"^^^) in a finite q 
range (approximately for q^ < 0.2). This is a new piece of evidence in favour of the 
clustering property, that is, of the algebraic decay of connected correlations. 

On the other hand, for q^ = q^^^ ^ 0.3 the exponent is clearly larger than 6{0). 
The fact that the scaling with 9{0) holds for a finite range and that, in particular, 
there seems to be no discontinuity at q = suggests the following scenario 

0(0 < \q\ < qEA) = 0(0) < 0(^?ea)- (ii-59) 



Therefore, we are in the conditions for the scaling law ( 11.491 to hold and we should 
have 6{0) = 1/v. In fact, our results are 

e{0) = 0.38(2), (11.60) 
1/v = 0.39(5), (11.61) 
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Figure 11.15: Subtracted correlation functions in units of L ^('^^ as a function of q. We used 
0(0) = 0.38(2) from ([1Z56). 



compatible with this expectation. We still lack, however, one final piece in the 
puzzle: the actual value of 6{q^A)- We shall attempt to compute it in Fourier space 
in the next section. 

Let us conclude the real-space study by examining the conditional variance 
of the correlation function. We shall write Var(C4(r) in an abuse of language 
(C4 is already defined as an averaged quantity, not a random variable) that we 
hope will create no confusion. Notice that, while ^{Qimkl^) = C^i^ = l\q) the 
variance Var(Qiink|'?) is different than Var(C4(l)|^J). These variances are plotted in 
Figure 11.16 where they are seen to decrease with L even faster than Var(Qiinkl^)- 



11.3.2.2 Fourier space 

In the previous section we saw that the equilibrium spatial correlations scaled well 
at q = with the 9{0) computed out of equilibrium and that, indeed, this same 
exponent seemed to rule the scaling for a finite q range. 

Here we want to make a more quantitative study, using the correlations in 
Fourier space (more convenient at q ^ 0, since they do not require a subtraction). 
In particular, we shall obtain independent estimates of 6{0) and 6{q^j^) and test the 
hypothesis of constant 6{q) in the range < q < qEA- 



To this end, let us recall definition (10.65) ^^id perform fits to 



Fa 



where (. was defined in (11.26 1. We have done this for q = Q and q 
T = 0.703 in Table 11.7 The scaling at = is very good and we obtain 



e{0) = 0.377(14), 



(11.62) 

q-EA at 

(11.63) 
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0.5 




Figure 11.16: Conditional variance of the spatial correlation functions at T = 0.703, which 
is seen to scale as ~ L^^-^^ , faster than the L^^-^ scaling of Var(Qiink|'?)- 



in excellent agreement with our non-equilibrium estimate (11.56 1. For q = q-^p^, 
however, we need to restrict our fitting range to L > 16. Furthermore, the uncer- 
tainty in ^EA = 0.52(3) induces a systematic error, which we denote with square 
brackets. Our final estimate is 



e(^?EA) =0.6n(16)[60]. 



(11.64) 



This estimate is compatible with the scaling law ^(i^EA) = 2/v — recall that 1/v = 
0.39(5). 

For the sake of completeness, and as a check of our procedure, we have also 
computed 6{0) at Tc- Our result, 6{Q) = 0.638(4), is very different from the value 
at T = 0.703 and compatible with the best result in the literature ||haso8b|| , 0(0) = 
0.625(10) (our error bar is deceptively small, because we have not taken scaling 
corrections into account, unlike BhasoSbII ). 

Finally, let us test the hypothesis of constant d{q) for \q\ < qEA- To this end, we 



show in Figure 11.17 the analogous plot to Figure 11.15 i^i Fourier space. Not only 
do we see a good scaling with 9{0) for a finite range, as in Figure 



11.15 



but now 



in our Fourier-space representation each of the curves is linear in below 
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L 


T = 1.109 




r = 0.703 




Pa 


Pa 


p 


8 


16.126(64) 


19.46(18) 


10.373(48) 


12 


40.59(18) 


53.65(62) 


25.79(15) 


16 


79.07(31) 


112.5(13) 


51.60(29) 


24 


204.05(83) 


327.3(43) 


134.54(80) 


32 


404.4(29) 


699(23) 


267.9(30) 


f min 


8 


8 


16 


;;tVdof 


0.90/3 


1.38/3 


0.13/1 


e 


0.6382(44) 


0.377(14) 


0.611(16)[60] 



Table 11.7: Fq^Q for our different system sizes and temperatures T = 1.109 ~ and T = 
0.703. We report at the bottom power-law fits = Ai^-^'^i^ in the range L > Lmm- At 
T = 0.703 we also consider Fq at cj = 0.523 ~ qEA- The second error bar, in square brackets, 
accounts for the uncertainty induced by the determination of q^A (a larger q^A produces a 
larger 6). 




Figure 11.17: Plot of Fg at T = 0.703 in units of ^^'-0(0) ^^le real-space representation 

of Figure 11.15 see a good scaling for a finite range in q. The vertical line marks 

our estimate of q^p^. 
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L 




8 


35.6(18) 


12 


110.4(58) 


16 


254(12) 


24 


816(40) 


32 


1823(216) 



Table 11.8: Coefficient ^2 in a fit to (11.65 ' for for 'J < 0-5 at T = 0.703. 



Therefore, we have performed the following fit 

Fq- Fo = -a2q^ + a^q'^. (11-65) 

Notice that, since the data at different q are correlated, subtracting the q = value 
yields a significant error reduction. The quadratic term is only included to control 
systematic effects in the computation of ^2 (and it probably causes us to overesti- 



mate errors). We report the resulting vales of ^2 in Table 11.8 



Now, in the case of discontinuous 6{q), one would expect a behaviour 

a2{L) = bf, (11.66) 



with c > D — 6{0). If we perform this fit from the data in Table 11.8 we obtain 
c = 2.91(6), with ;\;^/d.o.f. = 0.08/3. Yet, the extremely small is an indication 
that we have probably overestimated the errors in fl2- Therefore, let us take the 
value of c seriously and consider the scaling of f^^ with q^, = 0.1 < q^A- Then, in 
order for the would-be different scaling atq^, to be noticeable, we need a lattice size 
large enough so that 

Fq-Fcj^ a2ql Fq. (11-67) 

The reader can check that this results in an enormous L ^ 3 x 10^. Therefore, we 
do not find any numerical evidence in favour of the perturbative prediction of a 
discontinuity at q = 0. 

Let us go back our favoured scenario of constant 6{q) in \q\ < q-^A- Then, the 
linear behaviour of Fq with q^ suggests a scaling 

fl2(L) = e£°-^(0)(l - d£^(o)-0(<jEA)). (11.68) 

In addition, in this hypothesis of constant 6{q) we have 6{0) = 1/v = 6{q-EA)/'2- 
from (11.37) and ( ii.49| >. Therefore, we can write 

fl2(L)=^'£°-^(°)(l-dr^(0)). (11.69) 



Fitting the data in Table 11.8 to this expression, fixing 9{0) = 0.38(2) and varying 
only the amplitudes, we obtain an excellent ;^^/d.o.f. = 0.35/3. Therefore, the 
simplest scenario consistent with the data is 

0{\q\ < qEA) = 0{0) = 1/v = 0.38(2), (11.70) 
0(^ea) = 20(0). (11.71) 



CHAPTER XII 



Conclusions and outlook 



In this dissertation we have explored two main themes in the context of the statis- 
tical mechanics of disordered systems, 

1. It is worthwhile (and feasible) to consider a statistical ensemble tailored to 
the problem at hand, both from the point of view of achieving thermalisation 
and in order to obtain the maximum amount of physical information about 
the system. 

2. When working with disordered systems, one cannot help dealing with non- 
equilibrium phenomena. In this sense, a quantitative relation between the 
equilibrium phase and the non-equilibrium evolution can, and should, be 
established. 

We took a Monte Carlo simulation approach to both issues, carrying out simula- 
tions in a variety of computing systems (conventional computing clusters, super- 
computing facilities and grid resources) for a combined total of several millions of 
CPU hours. In addition, we analysed the data from about one year of non-stop 
production of Janus, a special-purpose machine equivalent to many thousands of 
conventional computers. 

In regards to the first point we introduced the Tethered Monte Carlo method, 
a general formalism to reconstruct the Hehnholtz effective potential associated to 
an arbitrary reaction coordinate. The method consists in performing individual 
Monte Carlo simulations at fixed values of the reaction coordinate, which are then 
combined to construct the Helmholtz potential from a fluctuation-dissipation for- 
malism. By choosing an appropriate reaction coordinate (i.e., one that labels the 
different relevant metastable states) it is possible to use this method to obtain a 
comprehensive picture of the system's physics. 

We demonstrated that this approach is indeed workable in physically relevant 
situations by tackling a classic 'hard' problem in the field of disordered systems: 
the phase transition of the diluted antiferromagnet in a field. We found that the 
tethered approach unlocked much information that remains hidden to a traditional 
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method and we were able to obtain a comprehensive and consistent picture of the 
critical behaviour in this model. 

We explored our second point, the statics-dynamics relation, in the field of spin 
glasses. There we analysed large-scale simulations performed on Janus. These 
were carried out both in and out of equilibrium and in both cases constituted a 
jump of several orders of magnitude with respect to the state of the art. We were 
able to use these high-quality data, combined with some novel analysis techniques, 
to establish a quantitative statics-dynamics equivalence and to determine the nature 
of the spin-glass phase up to the experimental scale. 

In the following subsections we detail our conclusions and outlook for each of 
our three lines: Tethered Monte Carlo, the DAFF and the Edwards-Anderson spin 
glass. 

12.1 

The Tethered Monte Carlo method 

We started by demonstrating our Tethered Monte Carlo method in the context 
of Ising ferromagnets. In this comparatively simple problem we found that our 
formalism, coupled to a local update algorithm, was able to reconstruct the effective 
potential without critical slowing down. We then showed that the method can be 
combined with sophisticated update schemes (cluster algorithms), in which case 
the critical slowing down disappeared even for non-magnetic observables. 

We then applied Tethered Monte Carlo to a system with a rugged free-energy 
landscape: the DAFF (see below for the physical results). We found that the method 
was able to eliminate the exponential slowing down caused by the free-energy 
barriers associated to our reaction coordinates (the magnetisation and staggered 
magnetisation). 

Thus, we were able to thermalise safely much larger systems that are possible 
with conventional methods, such as parallel tempering in the canonical ensemble. 
However, for our largest systems we started to see metastable behaviour caused by 
new free-energy barriers. In this sense, the onset of exponential slowing down was 
delayed, rather than outright eliminated. 

Dealing with these new free-energy barriers will probably require the intro- 
duction of additional reaction coordinates, capable of a further classification of 
metastable states. Our method is in principle equipped to carry out this plan, but 
a practical implementation requires some work. 

A different avenue for further work with Tethered Monte Carlo is extending 
its application to new systems. One interesting possibility is the study of the con- 
densation transition liBiso2i |bino3[ , |MAC04} Inusso61 InussoSL which can be modelled 
with Ising systems. We finally note that the tethered formalism has already been 
applied to the study of hard-spheres crystallisation [ iferiiaI . 
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12.2 

The diluted antiferromagnet in a field 

The use of the tethered formalism allowed us to obtain a consistent picture of 
the critical behaviour in the DAFF. We obtained clear evidence in favour of the 
second-order nature of the transition and computed the three independent critical 
exponents. In particular, we obtained a precise determination, Q — 1.469(20), for 
the elusive hyperscaling violations exponent. We also observed a clear divergence 
of the specific heat, in accordance with experimental work. 

Still, there is much work still to do. In particular, our data for the v critical 
exponent was affected by finite-size effects. Also, even if we determined that the 
specific-heat exponent a is positive, we could not estimate its precise value very 
precisely. To this end, the simulation of even larger systems would be required. 

Finally, it would be very interesting to achieve a closer connection to experi- 
mental work. The obvious contact point would be obtaining a precise numerical 
determination of the scattering line shape, which is the basic quantity explored in 
experiments to characterise the critical behaviour. Our methods are well suited to 
carry out this programme in the future. 

12.3 

The Edwards- Anderson spin glass 

We studied the D = 3 Edwards- Anderson spin glass in and out of equilibrium 
and were able to establish a time-length dictionary, relating the equilibrium phase 
of a system of size L and the non-equilibrium state at time of a system in the 
thermodynamical limit. In particular, we reached the conclusion that the equilib- 
rium phase of a system of size L ^ 100, and not the thermodynamical limit, is the 
relevant one for understanding experimental work on spin glasses. 

We later took this quantitative statics-dynamics connection one step further by 
establishing a finite-time scaling framework. This last result, in which dynamical 
heterogeneity played a major role, has implications for experimental work. 

The above studies were carried out in a neutral formalism, without assuming 
the validity of any particular theory for the spin-glass phase. A more detailed 
analysis of the spin-glass phase showed clear evidence in favour of the RSB picture, 
at least for experimentally relevant scales. We based this conclusion on three main 
observations: (i) non-triviality of the spin overlap, (11) overlap equivalence and (iii) 
the existence of a non-zero replicon exponent. 

When trying to decide which is the relevant theory in the thermodynamical 
limit (equivalent to the limit of infinite experimental time), we found that our data 
still supported the RSB picture, but here our results are not as conclusive (in prin- 
ciple, there can always appear a change of regime for much larger sizes or much 
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longer times, even if we see no trace of it with our data). 

There are several promising avenues for future work in this field. First, deciding 
in favour of RSB for the nature of the spin-glass phase does not automatically grant 
one full knowledge of its characteristics. Much work is needed in order to work 
out some very important details, such as the nature of temperature chaos. 

Perhaps the most interesting prospect for future work is trying to reproduce, 
as realistically as possible, an actual spin-glass experiment. Our non-equilibrium 
simulations only considered the simplest protocol of isothermal aging, but many 
interesting effects (memory, rejuvenation, etc.) can only be observed by varying 
the temperature. In this sense, our current simulations were limited by the onset of 
finite-size effects. Indeed, for temperatures close to the critical point, our simulated 
size L = 80 is not large enough to be representative of the non-equilibrium physics 
for experimentally relevant times, since the coherence length grows too quickly. 
Let us stress that although this effect is restricted to the vicinity of the critical 
temperature, any experimental cooling protocol should expend a time close to Tc 
which is very long in the microscopic scalej^ 

Simulating even larger lattices is, therefore, a must if one wants to reproduce 
sophisticated experimental protocols. We are very optimistic in this sense: we 
are already testing a code that allows us to simulate lattices of size L = 256 and 
we also have funding for Janus II, which will not only allow us to simulate even 
larger systems, but also to probe longer times, taking us fully into the experimental 
regime. 



^Recall that 1 MCS is roughly equivalent to one picosecond and that our L = 80 simulations at 
T = Tc showed finite-size effects from about 10^ MCS. 



APPENDIX A 



Thermalisation in Monte Carlo simulations 



This appendix is intended as a reference on thermalisation in Monte Carlo simula- 
tions, providing some definitions that are used throughout this thesis. Most of the 
definitions are standard, but subsection A.3.1 contains some methods developed in 
part during this thesis. 



A.i 

Markov chain Monte Carlo 

Let us begin by recalling some definitions and results relevant to the theory of 
dynamic Monte Carlo methods (this is not meant to be a rigorous or self-contained 
account, just a quick reference, cf. jsoK97}[RUB07] ). In general, we are interested in 
extracting configurations from some probability distribution p{{sx}), often of the 
kind p{{sx}) ^ e"^^''^^^^^. In order to simplify the notation, we denote the system 
configuration by a single random variable X and our target probability distribution 
hYP, = P{X = x). 

Then we define our Monte Carlo method as a random walk in configuration 
space, that is, a sequence Xt of random variables. The first one — the starting 

configuration — will follow some simple probability distribution Px^^ (completely 
random spins, for instance). We need the random walk to reach a stationary state 

such that limi^.00 Px "^ = Px- 

We consider a Markov chain: the probability distribution of X^+i depends only 
on Xt and not on the whole previous history. Then, we can specify the whole 

process by giving the initial distribution Px^^ and a transition matrix Uxy- The 
latter is defined as 

Uxy = V{Xt+i=y\Xt = x), (A.i) 
that is, the probability that the system will be in the configuration y at time i + 1 if 
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it was at x at time t. Of course, we can iterate this definition 

[U%y = V{Xt+n = y\Xt = X). (A.2) 

The objective is that, no matter the initial distribution Pi^\ after a sufficiently long 
number of iterations we obtain configurations distributed according to Px- In other 
words 

lim [W]xy = Px. (A.3) 
We need two ingredients for this 

• First, we need our method to be capable of exploring the whole configuration 
space. To this end, we require the transition matrix to be irreducible: for all 
x,\j there exists an n such that > 0. 

• Naively, we could think of requiring that, once we have reached the stationary 
distribution Pj, the Markov process should lose all sense of direction and be 
reversible. We should not be able to tell if it is running backwards or forwards. 
That is 

PxUxy = PyUyx. (A.4) 

This is called the detailed balance condition. Actually, it turns out that this is 
too restrictive. We only need that the stationary distribution of the Markov 
chain correspond to Px, that is 

X 

This is the balance condition. 



If Uxy satisfies these two conditions, then (A.3 1 can be proved and we have a le- 
gitimate Monte Carlo method. Notice that the balance condition is easier to check 
than it seems, since it does not depend on the normalisation of the Px (typically 
given by the unknown partition function Z). 

Perhaps the easiest example of Monte Carlo dynamics is the heat bath. Let 
us consider the update of a single Ising spin at site x, Sx — ^ s'^, where all the 
remaining spins in the lattice are kept fixed. Then, we consider the following 
update probability 

V{X,^r = {s'}\X, = {s}) = ^^^^_^^^.pEiis"}y (^-6) 

Notice that all the terms in the energy that do not involve site x factor out. There- 
fore, it is obvious that the update of a single spin satisfies the detailed balance 
condition. In order to define a full Monte Carlo method we need to update all the 
spins. We can do this by randomly choosing one spin each time and defining each 
Monte Carlo step as N spin updates (so that each spin in the lattice is updated 
once, on average). In this case the whole transition matrix would also satisfy de- 
tailed balance. We can also consider a scheme where we run through the lattice 
sequentially, visiting all the sites in order. In this case, the process would clearly 
not be reversible, and the transition matrix would only satisfy balance. 
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A.2 

The autocorrelation times 

As a general rule, the thermalisation of a Monte Carlo simulation (i.e., how long it 
takes to reach the stationary distribution) should be assessed through the temporal 
autocorrelation functions ISOK97]. Let O be an observable, defined as always as a 
real-valued function on the configuration space, and assume we have equilibrated 
the system for a very long time, so we have reached the stationary regime. Then, 
we consider the equilibrium (stationary) evolution 0{t) and define the equilibrium 
autocorrelation functions 

Co(0 = ([0(0) - (O)][O(0 - (0)]>, poii) = ^ . (A.7) 

The angle brackets here denote any kind of thermal average (e.g., canonical or 
tethered). The autocorrelation function decays as an exponential for long times, 

Po^f) Aexp[-i/T]. (A.8) 

This asymptotic behaviour suggests the definition of the exponential autocorrela- 
tion time. 

Wo = lim sup — \ — —-, (A.9) 

^' t^oo -log||Oo(i)l 

Texp = sup Texp,0- (A-IO) 
O 

In general, we express the autocorrelation as a sum of exponential^ 

poit) = Y^A,e-'/^K (A.11) 

i 

The exponential time, then, is the largest of the t, and characterises the relaxation 
time of a certain observable. Actually, barring symmetry considerations, the expo- 
nential time is the same for all observables. 

Another useful concept is that of integrated autocorrelation time 

-j^ 00 

nnt,0 = ^+J^Po{t), (a. 12) 

^ t=l 

The integrated time indicates the minimum time difference so that two measure- 
ments of some observable O can be considered independent (i.e., uncorrelated). In 



^Actually, this is only strictly true if the dynamics fulfils detailed balance and is not only irre- 
ducible but aperiodic. Otherwise, there can be some modes in the form of damped oscillations. 
However, these are never observed, within errors, in the applications we consider in this disserta- 
tion. 
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particular, if Var(O) is the variance of a single measurement of O (after we have 
reached equilibrium) and we average over M successive such measurements, the 
resulting variance is 

Var(Ol-^') = > (A.I3) 

A//(2Tint,o) 

as opposed to the value Var(0)/A/' that we would obtain for M statistically inde- 
pendent measurements. 

The integrated time is less useful as a measure of thermalisation than the expo- 
nential. In fact, notwithstanding the previous discussion, the determination of Tijit,o 
is not even always needed to estimate statistical errors (see Appendix |B]). It has, 
however, the considerable advantage of being much easier to measure safely. No- 
tice as well that, if the decomposition (A.ii) contains a single exponential, Texp,o = 
Tint,0/ This approximation can often be employed. Alternatively, if we do not take 
measurements for every single MCS or, better, if we make bins of consecutive mea- 
surements, we can neutralise the faster modes in (a.ii) and approach the single- 
exponential limit. By 'binning' we understand averaging groups of n consecutive 
measurements. In this case, all the components with t, <C n dissappear. The same 
approach is taken in a different context, that of random variables strongly corre- 
lated in space, close to a critical point (Kadanoff- Wilson blocks). 



A.2.I Some computational recipes 

For a large-scale MC simulation, where we have to handle very long time series 
(millions of time steps even after the binning procedure mentioned above), the 
computati on o f po(0 tie time-consuming. Indeed, a naive implementation of 



definition (A.7I requires 0{J\f^) operations. 

We can do better. For simplicity, and without loss of generality, let us consider 
an observable such that (O) = 0, for which we take Af measurements 0{t), t = 
0, . . . , A/" — 1. We also assume all this measurements have already been taken in the 
equilibrium regime (i.e., we have discarded a sufficient number of measurements 



for thermalisation). Then we could estimate Cq a^ 

[Co(0] = ;^'^E 'o{s)o{s + t). (A.14) 



s=0 



Let us define 
and 



Soit) = {Af - t)Co{t) (A.15) 



^ ^ ^ 0, M <t<2N, ^ ' 



and we consider 0'{t) to be extended periodically. 



^We use the notation [O] for the numerical estimator of some thermal average (O). 
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Then we can write our numerical estimate for the unnormalised correlation of 
a as 

[So'(0]= E C)'(s)O'(s + (A.17) 

s=0 

Notice that 

\So\t)\ = [So{t)]. (A.18) 
Now consider the Fourier transform of So'{t), 

2A/'-12A/'-l 

[So'm= E E O'(s)O'(s + 0e-'?^ (A.19) 

t=0 s=0 

Notice that, since we are summing t over a whole period of the function, we can 
change s + t ^ t' in the second sum 

2J\f-l 2Af-l 

[So^m = E 0'{s) E O'ity^'e-^'^^' (A.20) 

s=0 t'=0 

/2A/'-l \ \ 

= E 0'(s)e^^M E O'(0e-^^n = |0(^)|2. (a.2i) 

V s=o /V f'=o / 

That is, the Fourier transform of the autocorrelation is equal to the modulus of the 
Fourier transform of O' (we have needed a periodic O' to get this result). This is 
the Wiener-Khtnchtn theorem. 

Therefore, we can obtain SQ/(i) = So{t) = (A/" — t)Co{t) by computing the 
Fourier transform of 0{t), evaluating its complex modulus and then computing 
the inverse Fourier transform of |0(i^) p. It may seem that we have gained nothing. 
However, the computation of discrete Fourier transforms, seemingly an 0{N"^) 
task, can actually be done in (A/" log A/") operations using the Fast Fourier Trans- 
form algorithm (see, e.g., |jPRE92|). In the work reported herein we have always 
followed this method to evaluate temporal autocorrelation functions. We use the 
FFTW implementation reported in [FR105]. 

Notice that an analogous method can be followed to evaluate spatial correla- 
tions, as we will have ample occasion to do (especially in Part [rv] >. In this case, 
even if the functions are three-dimensional, the procedure is even simpler, as the 
periodic boundary conditions are already built in. Therefore, we do not have to 
pad our observables with zeros, as in ( |a.i6[ >. 

Once we have computed the |Oo(0 need to estimate the autocorrelation 
times. For the exponential ones our only recourse is a fit to one or more expo- 
nential modes (see below). The integrated time must, in principle, be computed 
from the sum of our numerical estimate [|Oo(0] extended to all our values of t. 
However, as is clear from our definition ( |a.14[ >, the final times have much smaller 
statistics, so their contribution is nothing but a white noise. Therefore, Tint,o is best 
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evaluated using a self-consistent window jsoKgyj l 

1 ^ 

Tint,0 = 2 + E[^o(0]' (a.22) 



t=l 



where A is chosen as A = WTjnt^O/ with W a tunable parameter (W = 6 works 
well). 

A.3 

Thermalisation in disordered systems 

The safe computation of the autocorrelation times described in the previous sec- 
tion requires a simulation time orders of magnitude larger than Tgxp- This is not a 
problem when simulating ordered systems, because extending the simulation re- 
duces the statistical error of the final results. With disordered systems, however, 
the main source of statistical error is the sample-to-sample fluctuation. Reducing 
the thermal error for each sample beyond a certain, easily reachable, threshold is 
useless. 

Therefore, given a fixed computational budget, we would like to simulate each 
sample for the shortest possible time that ensures thermalisation|3] Since so short 
a run time does not permit a computation of the Tq for physical observables, re- 
searchers have traditionally employed less rigorous methods to check thermalisa- 
tion. 

In general, the typical method is to study the time evolution of the disorder- 
averaged physical observables. In particular, one of the most widespread practical 
recipes is the so-called log2-binning. One divides the simulation time M in loga- 
rithmic intervals X„: 

Xn = (2-«+W, 2-«AA], (A.23) 

Notice that with this definition time decreases with increasing n, so Xq corresponds 
to the second half of the run, Ii to the second quarter, etc. Then, one evaluates 
the thermal average for each sample and the disorder average, for each of the time 
intervals. If the last few Xn show no evolution, the simulations are considered to 
be thermalised. 

This procedure is not optimal, because in many situations the thermalisation 



time is wildly dependent on the sample (see Figures 8.5 and 8.6 on page 123 for 



the DAFF or Figures e.i and |e.2 on page 287 for the Edwards- Anderson spin 



glass). Thus, if we choose a simulation time that would thermalise even the slowest 
samples, we will spend most of our time extending already well thermalised runs 
for no benefit. Worse, a stationary average over many samples may hide the fact 



3 Notice that this not only would allow us to maximise the number of samples we can average 
for a fixed total CPU time, but, assuming we can run many samples at the same time, would also 
minimise the wall-clock time (the physical time we have to wait for the results). 
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that a few samples are still quite far from equilibrium (the thermalisation time can 
vary along several orders of magnitude from one sample to another). 

One could toughen the log2-binning test by taking the correlations into ac- 
count jFER07j . Indeed, since the samples considered for each T„ are always the 
same, the difference between the resulting averages (even assuming complete tem- 
poral decorrelation) will typically be much smaller than their statistical errors. This 
is because, as we said earlier, the thermal fluctuations account for an almost negligi- 
ble part of the total statistical error. We can take this into account by computing for 
each sample the differences between the average for Iq and each of the X„ and con- 
sidering the evolution of these correlated differences, which have a much smaller 



error (see Figure 8.9 1. To be sure, this is a more stringent test, but ultimately an 
unphysical one (because it considers much smaller errors than those of the physical 
results). Furthermore, it does not address the important problem of efficiency. 

In the next section we shall see how we can provide a more rigorous method, 
taking advantage of the parallel tempering algorithm used to accelerate the ther- 
malisation. 

A.3.1 Parallel tempering 



Parallel tempering | |huk96[ |mar98[ is one of the most common methods to accel- 



erate thermalisation. Its motivation is the rugged free-energy landscape picture 
described in the General Introduction. Let us consider a system described in the 
canonical ensemble by the following partition function 

m = E e"^'- (^-24) 

{Sx} 

At a physically interesting low temperature the system is trapped in deep free- 
energy valleys, with extremely long escape times. At a higher temperature, how- 
ever, the energy fluctuations are enough to overcome these barriers quickly. 

We can take advantage of this by simulating A/j copies of the system (in our 
case, Mt starting configurations with the same disorder realisation). Each copy 
starts the simulation at a different temperature, with Ti < T2 < . . . < Tj</^. We 
assume that Ti is such that our chosen Monte Carlo dynamics would need an 
impossibly long time to thermalise the system, while Tj\f^ is high enough for the 
same dynamics to achieve thermalisation very quickly. After updating each copy 
independently for one or more MCS we perform the parallel tempering update. 

Then, we consider a configuration space with AfjiV + 1) degrees of freedom, 

{T^^\ {sx^}}. The temperature associated to copy i of the system can change, but 
the ensemble of temperatures {j6'^'^} must always be a permutation of 

. . . ,/3^J = {1/Ti, . . . , l/r^J. (A.25) 

Now, the partition function of the whole ensemble of systems is 
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We can compute the marginal distributions for the Summing over the spins, we 
find that the marginal probability density of the temperature configuration {/3*^')} 
is uniform, 

p({/3«}) = 1/(AAt!). (A.27) 
Moreover, the marginal probability for a single temperature is also uniform, 

p(i6W) = l/Mj. (A.28) 



The partition function (A.26) allows us to consider a Monte Carlo dynamics 
where we simply try to exchange copies i and / of the system, initially at neigh- 
bouring temperatures T]^ and T/^-^i, 



{r = hds):'}}x{^'^^ = h+v{s)i'}} 



r\ r^ (A.29) 

> {/3^''^ = /3)c+i,{s?}}x{/30)=/3,,{sW}}. 



The Metropolis acceptance probability (see Section 3.2) for this dynamics is 



V = min{l,a;new/a;oid} = min{l,exp[-(j6fc+i - ^k){E^^^ - E^'^)] }• (A.30) 

In order for this dynamics to be irreducible, a full parallel temperature update 
consists in attempting one exchange per temperature, going sequentially from the 
lowest to the highest (so one configuration can actually jump several temperatures 
in a single update). 

This way, the temperature of each copy performs a random walk in temperature 
space. Once the system reaches a high enough temperature, a few MCS are enough 
to make it cross the free-energy barriers so when it eventually returns to its original 
temperature it has been decorrelated from the starting configuration. 

Notice that for this algorithm to work, we need to choose the participating tem- 
peratures close enough for their energy histograms to overlap, so the acceptance 
probability ( a.3o[ > is high enough. 



A.3.2 The temperature random walk 

In equilibrium, each copy should spend the same time at each temperature (re- 



call (A.27)), so studying the temperature random walk can give information about 
the thermalisation. In particular, several round trips from lowest to highest tem- 
perature are needed for a system to be considered thermalised. 

We can promote this idea to a fully quantitative and physically meaningful 
leveig Let us assume that our range of temperatures crosses the critical point and 
let us define ic such that 

T,^_i < Tc < T,^. (A.31) 
Now, for one of the N't copies of the system, we consider its temperature random 



walk mapped to the range of the temperature index i{t) G {!,... , A/V}. Figure A.i 



^The method described herein was first introduced in [ jFERogBj and we later refined it in [|janioa| . 
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10^^ 2 ■ 10^^ 



t 

Figure a.i: Temperature random walk of a single configuration in one of our spin-glass 
simulations of Appendix [Ej We plot the temperature index i{t) against the time measured 
in heat bath updates. The critical temperature corresponds to ic = 17 (in the middle of the 
range). 



shows one example, taken from our spin-glass simulations. Notice that the random 
walk is clearly not Markovian, as evinced by the long plateaux, since the system 
retains for a long time a memory that it started in the high (or low) temperature 
phase. 

Our goal is to construct a quantity that is both representative of the random 
walk and amenable to the computation of autocorrelation times described in the 
previous section. To this end, we consider a mapping f{i) such that 

f{i) > 0, > ic, (A.32) 

f{i) < 0, Vz < ic, (A.33) 

E/(0=0. (A.34) 

i=\ 



Recalling (A.28), we see that the last condition is just (/) = 0. It is also convenient 
that / be monotonic. Generally, one chooses the temperatures so that ic is in the 
middle of the range, so a simple linear / suffices. 



Now we can compute the autocorrelation Cf{t) with Eq. (a. 14 1. Notice that / 
has an advantage over the Cq of the physical observables: we can average the cor- 
relations over the Mj copies in the parallel tempering. These are not, of course, 
completely independent (the most obvious constraint being that each must be at a 
different temperature), but still the averaged correlation has a much smaller vari- 
ance. Furthermore, in the case of spin glasses one simulates several statistically 
independent replicas, which provide an additional averaging layer and allow an 
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estimate of the statistical error. 



As explained in Section A.2 the thermalisation of the system is characterised 
by the exponential autocorrelation time. Recall that, barring symmetry consider- 
ations, the exponential time should be the same for all random variables in the 
simulation, including the physical observables. In general, the exponential auto- 
correlation time must be computed from a fit to an equation such as ( a.ii[ >, with 
an appropriate number of exponential modes. This is not a problem for ordered 
systems simulations, where one can simply select the fitting range and functional 
form manually. When simulating disordered systems, however, one would have to 
do thousands of such fits, each with a different fitting range (in our study of the 
DAFF, for instance, we performed several hundred thousands independent parallel 
tempering simulations). 

Clearly, an automated system must be found. The first step is making the 
Cj:{t) as simple as possible. In this respect we notice that the choice of / should 
not modify the exponential autocorrelation time, but will affect the relative sizes 
of the Ai. Our criteria ( a.32[ > and ( a.33[ > hopefully select a family of functions 
that reduce the amplitude of the irrelevant fast modes. A second simplification is 
afford ed b y t he sh eer length of the simulations in practical cases. From a look at 
Tables 8.1 and e.2 we see that the runs can be as long as 10^^ MCS. In this situation, 
averaging the f[i{t)) over a large number of consecutive measurements implies no 
real information loss (as long as the bins are much smaller than Texp)- This binning 
procedure also has the welcome side effect of suppressing the fastest modes. 

With these considerations in mind, in practice it turns out that the Pf{t) can be 
adequately described by considering just two exponential modes. As an example of 
this, we have plotted in Figure a.2 several autocorrelation functions from our spin- 
glass simulations. The depicted correlation functions have been chosen randomly, 
with the only condition that they represent samples with very different exponential 
autocorrelation times. 



A.3.3 Thermalisation protocol for parallel tempering simulations 

We can summarise the previous discussion in the following three-step thermalisa- 
tion procedure 

1. Simulate for a minimum length of A/""^^"^ MCS, chosen to be enough to ther- 
malise most samples. Notice that most published parallel tempering studies 
stop here, assessing thermalisation only through the evolution of disorder- 
averaged observables. 

2. Discard the first sixth of the measurements and compute the Pf{t) from 
the remainder. Compute the integrated autocorrelation time with the self- 



consistent window method of Eq. (a. 22 1, using W = 6. Using this first es- 
timate of the integrated time, we extend the simulation until Af > A^"*Tint/ 
always discarding the first sixth. This criterion was introduced in jFER09B] . 
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Figure a.2: Autocorrelation functions for some of our equilibrium spin-glass simulations 
of Appendix |EJ showing samples with Texp across several orders of magnitude, plotted 
against the simulation time in units of 10^ MCS. We have plotted the range [0, 6Texp] and 
also show the result of the automated fitting procedure described in Section lA^^ In order 



to avoid cluttering the graphs we have only plotted a few times (the actual correlation 
functions have many more points). In the last panel the correlation function has a strong 
downwards fluctuation. Our fitting code takes this into account by choosing a restricted 
range and fitting to a single exponential. 
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3. Now that the simulation is reasonably dimensioned, estimate Texp (typically 
of the same order of magnitude as Tint, but always bigger). Extend the simu- 



lations until N > A^^Pzexp- 



The actual thresholds and A'^"? are somewhat arbitrary and should be tuned to 
the system at hand (especially A^"*, since the integrated time, unlike the exponential 
one, depends on the choice of / and the frequency of its measurements). 

We introduced the third step in this protocol for our spin-glass simulations 



in HjanioaI , choosing A^"* = 22, A*^^? = 12 (see A.3.3.2 for our criteria for the 
DAFF). Notice that this implied performing non-linear fits with custom ranges for 
some IC* autocorrelation functions. The first step was parameterising Pf{t) by a 
double exponential decay (an assumption empirically justified by the shape of these 



correlation functions, cf. Figure A.2I. 



Pf{t) ~ Aie-'/^i + A2e-^/^2, n = Texp > T2. (A.35) 

The choice of fitting range of course depends on the values of the t,, which vary 
across several orders of magnitude from one sample to another. However, notice 
that, if A2 = 0, then Texp = Tint- Therefore, Tint is a good starting assumption for 
Texp/ so we took the following steps: 

(i) Perform a first fit to a single exponential in the range [2Tint/3Tint], yielding an 
amplitude A and a time t. 

(11) Using Ti = T, T2 = t/10, Ai = a and A2 = 1 — A as a starting point, perform 



a non-linear fit to (A.35), with the fitting range [Tint/ 10, lOTint]. 



In most cases we can stop here. Sometimes, however, the second fit fails — typically 
because one of the amplitudes is very small, so Pf{t) is indistinguishable from a 
single exponential. This can be detected, and solved, in a number of ways: 

(iii.a) One of the A; is A; ^ 1, t\ has an absurdly large value (indicating a fit to a 
short plateau in a noisy range) or the iterative fitting method breaks down. 
In these cases, a single exponential fits the data better. We perform a fit to a 
single exponential in the range [5Tint, lOTint] • 

(iii.b) One of the A, is negative, indicating an unphysical downwards fluctuation in 



the Pf{t) for large times (cf. bottom-left panel of Figure A.2I. We perform a 
fit to a single exponential in the reduced range [2.5Tint, 3.5Tint] • 



A.3.3.1 Fail-safe mechanisms 

This automatic and fully quantitative procedure works for most samples, but there 
are some potential pitfalls that can lead to our underestimating Tgxp. For instance, 
the exponential time may sometimes be much larger than Tint- In these situations, 
Ai is very small and it is very difficult to fit the data properly. Therefore, if our 
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Figure A.3: Time spent above the critical temperature against total simulation time for two 
spin-glass samples. In each case we plot the A/V copies of the system. The left panel shows 
an 'easy' sample, where the random walk is relatively fluid and all configurations progress 
gradually. The right panel shows a 'hard' sample, where the configurations get trapped 
in local minima at low temperatures, producing long plateaux (notice the scales in both 
plots). In a very short run time (inset), only one of the copies of the second system would 
betray its hard character. 



fitting method detects Tgxp > lOzint, we increase the measurement bins by a factor 
of 10, making both Tint arid Ai grow and easing the fitting procedure. 

More generally, we have assumed that our Pf{t) has been measured in equilib- 
rium, or close to it (i.e., that our simulation is at least a couple of Tgxp long). In a 
very short run, however, some of the configurations may not even have had time 
to visit the relevant minima, and get trapped in them, producing a misleadingly 
fluid random walk. In particular, this happens when some of the copies have not 
crossed the critical temperature in the parallel tempering dynamics. To prevent 
this from happening, we also measure ihot/ defined as the time that each copy has 
spent above the critical temperature. In case one of the copies has a ihot smaller 
than a third of the median one, we consider that the simulation time is woefully 
underestimated and we simply double it. Notice that, unlike our other criteria, this 
is not completely quantitative. As a general rule, if many samples only raise the 
alarm because of the fhot check, we must conclude that our J\f™^ is too short and 
extend all the runs to a larger minimum run time. We show two examples of ^hot 



as a function of simulation time in Figure A.3 
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A.3.3.2 The criteria for the DAFF simulations 

The procedure described above for the computation of the Texp must be tuned for 
the physical system we want to study. For example, consider the case of the DAFF 
simulations discussed in Chapter [8j In contrast with our spin-glass simulations, 
which probed deep into the low-temperature region, these were restricted to the 
neighbourhood of the critical point. As a consequence, the resulting Pf{t) were gen- 
erally simpler than those of the spin glass. Indeed, most of the time we could only 
see one exponential mode, so T^nt — Tgxp- On the other hand, we only simulated 
one replica, so we could not easily estimate the error in Pf{t). This complicates the 
evaluation of Texp (making it difficult to distinguish fluctuations from real effects), 
especially for the shorter runs, which have smaller statistics. 

To take this into account, we modified our criteria to give more weight to the 
integrated time. We required a minimum run length of lOOZint (i.e., we chose = 
100 in step 11 above). In this way, even if Tgxp — lOxint, an upper bound in our 
experience, we would still have a run length of about lOxexp- 
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Analysing strongly correlated data 



In a Monte Carlo computation, one has to deal both with systematic and a statistical 
error sources. Furthermore, the strong correlation between the data makes the 
estimation of this latter error delicate. In this Appendix we present some recipes to 
handle correlated data. 



B.i 

Computing thermal averages from correlated data 

We start by resuming an issue introduced in Appendix |A} how to estimate the error 
in the average of correlated measurements. The methods contained in this section 
are well known, to the point of being included in textbooks jAMi03] . It is, however, 
convenient to present a summarised account here, if only to fix the notation for the 
more sophisticated methods of succeeding sections. 

Let us suppose we have J\f independent measurements O, of some random 
variable (observable) O. We want to use them to estimate the thermal average (O), 
for which our estimator [O] is naturally 

■'^ i=i 

This estimator is unbiased, meaning that ([O]) = (O). More than that, by virtue of 
the central limit theorem we can expect it to behave as 



[0] = {0)+r]-^ = {0)+r]Ao, cr = y'Var(O), (b.2) 
where t] is a standard Gaussian random variable. We can likewise estimate the 
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error Z\o as 



[A 



oJ 



Af-1 



N 



(B.3) 



Again, [z^q] is an unbiased estimator for A^q. It can be further shown that 



[A 



OJ 



A 



o 



\/2Ar 



(B.4) 



So, in order to get a reliable error estimate, we need to average at least 50 inde- 
pendent measurements. Now consider that our O/ are the values 0{t) that the 
observable O takes along a Monte Carlo simulation. The former formulas are 
no longer valid, because the 0(f) are no longer independent. In particular, it is 
straightforward to show that 



(([O] - {0)f) = ^((02) - (Of) + o{M-^). 



(B.5) 



Therefore, the effective number of independent measurements when estimating 
errors is not M, but A/'/2Tint,o- The computation of the Tint,o is costly and usually 
not very precise, so we need a more robust method to estimate errors. One obvious 
possibility is to consider M independent runs O(0(f). Then we can compute the 
individual [O^'^] and do 



M 



[Ao] 



M-1 



(B.6) 

(B.7) 



This strategy guarantees us independent data, but it is very cumbersome, since we 
need many runs in order to get an acceptable error estimate. It contains, however, 
the germ of a workable system: data binning. Indeed, let us combine our M 
measurements of O along a simulation into B blocks of size N / B = n: 



-1 ni 

Oi = - L o{t). 

t={i-l)n+l 



(B.8) 



Notice that 



lOi]-lLOi-^L t o{t) = [o]. 



nB 



(B.9) 



i t={i-l)n+l 



B.2 — Non-linear functions and the jackknife method 



Now let us assume that the block length n is n 3> Tint,0/ so the O, can be considered 
independent from one anotherj^ We have, then 



8-1 



B 



(b.io) 



Of course, if we do not know Tint,o we cannot know whether n ^ Tint,o- However, 
we can just evaluate [Aq\ for increasing block lengths n. For small n the error will 
be underestimated, but we will eventually reach a stationary regime. Once this is 
achieved, we should not let n grow much more, in order to maximise the number 
of blocks B and, with it, the precision of our error estimate. 



B.2 

Non-linear functions and the jackknife method 

Often, the interesting physical quantities are not restricted to thermal averages 
of observables, but include functions of these thermal averages (susceptibilities, 
correlation lengths, etc.). In general, we estimate /((O)) by /([O]). However, now 
/([O]) is a biased estimator for /((O)) (unless / is linear). Fortunately, this bias is 
0{M~'^), much smaller th an the statistical error which, as we have seen, goes as 
0(A/'~^^^) (yet, see Section B.2.i[). However, we still have the problem of providing 



an error estimate for our /([OJ). Naively, we would do this with a linear error 
propagation but here the correlations actually play in our favour, reducing the 
errors. 

Throughout this thesis we have always employed the jackknife method. This is 
simple and robust but not, we must say, the only possible choice (the other main 



alternative is the bootstrap, cf. Section B.3.3.1). We consider a function / of one or 



several thermal averages and we define the following procedure 

1. Estimate the central value /((A) , (B) , . . . , (Z)) by /([A], [B], . . . , [Z]). 



2. Bin each of the observables, as in (b.8 I, with large enough n and B. Then form 
the jackknife blocks, 

3. Evaluate the function for each jackknife block, 

/jK,! = / (^JK,f/ BjK,i/ • • • / ■ (b. 12) 



^There are boundary effects, but they are only 0{n 
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4. Compute the jackknife error estimate 



[4] = {B-1] 




(B.13) 



This error estimate coincides exactly with (b.io) for linear functions. For non-linear 
ones, it takes the correlations into account and provides a reliable estimate, since 
the blocks are allowed to fluctuate jointly. 

B.2.I Disordered systems 

In this thesis, we are mainly concerned with disordered systems, where one has 
to compute double averages of the type (O). In these cases, the application of 
the data binning procedure is even easier. Since each thermal average (O) comes 
from a different, independent sample we do not have to concern ourselves with 
the block length. We simply consider one block for each sample, so is replaced 
by our estimate [O/] for the thermal average of the z'-th sample. We can then form 
jackknife blocks with these [0/] as usual. 

However, not all is simple. Sometimes we wish to compute disorder averages 
of non-linear functions of thermal averages, that is, quantities of the type /((O)). 
In these cases, the bias produced by mocking / ( (O) ) by /( [O] ) can have dangerous 
effects. There are general cures for this problem (see, for instance, jBALg/i l). A 
particularly elegant solution is afforded by the simulation of several real replicas 
for each sample (for systems such as spin glasses, this is something we have to do 
anyway). 

For instance, let us consider a simple quadratic function, (O) . We already saw 
in (B.5I that [O]^ is a biased estimator of (O)^, with a bias of 0{Af~^). Instead, if 



we run two independent simulations, resulting in time series O* and we have 

(O'^Of^) = (O^) (O^) = {of . (B.14) 

However, the numerical estimate [0*0^] can be computed without considering 
the square of an averaged quantity (just as if 0*0^ where a single observable). 
Therefore, 

([0«0^]) = (O'^OO = {of (B.15) 



B.3 

Computing fits of correlated data 

Let us suppose we have a series of J\f points {xi,yi), which we want to fit to a 
functional form y = y{x;ai,. . .,an), depending on some unknown parameters flf. 
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For each point we have an estimate Ai of the statistical error (standard deviation). 
Then, the maximum likelihood estimator for the can be obtained by minimising 
the 'chi-square' 

If our model function is linear in the and each of the y/ are Gaussianly dis- 
tributed, it can be shown that the values of the minimum follow the chi-square 
distribution for J\f — n degrees of freedom. This distribution has mean J\f — n and 
standard deviation ^J2{N — n). Because of the central limit theorem it tends to a 
Gaussian distribution in the limit of large M — n. Notice that this also guarantees 
that it will not be too wrong to treat a non-linear function of the a, in the same way, 
as is usually done. 

The above results mean that not only can we use the to obtain the best fit 
parameters for a given functional form, but also to test whether this function is a 
good model for the data. Indeed, for a 'good' fit, the value of P^r degree of 
freedom should be ;^^/d.o.f. 1. Too large a value of ;^'^/d.o.f. would mean that 
the function is not a good fit for the data (too low a value usually means that the 
Ai are overestimated, yet see below). To be more precise, the probability that 
should exceed a particular value }i by chance in a fit with v = J\f — n degrees of 
freedom is given by the incomplete gamma function. 

Notice that giving the value of x^/ d.o.f. is not enough, we need to know x^ arid v 
separately. For example, a fit with 2 degrees of freedom and x^ — 4: would have 
X^/d.o.i. — 2 and V ~ 0.135 (not a very good fit, but still acceptable). However, a 
fit with 10 degrees of freedom and x^ = 20 would also have /d.o.f. = 2, yet now 
the P-value would heV 0.029, indicating a bad fit. 

We can also use the x^ test to estimate the errors in the fit parameters. Let 
us denote by x^nm by af^^ the best fit parameters and the resulting x^ value. 
Then, finding the error in is a matter of perturbing this parameter and noting the 
change in x^- The traditional recipe, followed here, is to modify af^^^ to af^^^ + Aai 
and perform a new fit where the other parameters are varied, but aj is kept fixed. 
This new fit will have a X^ = /Cmin + ^X^- Since we are considering the fluctuations 
in x^ caused by a single (assumed to be) Gaussian variable Uj, a variation of one 
standard deviation in a/ is equivalent to Ax^ — 1. 

We have hitherto considered the simiplifying assumption that the i/; are un- 
correlated. This is sometimes the case, a typical example being fits where the x 
coordinate is the lattice size, so each comes from a completely independent sim- 
ulation. Very often, however, the data are strongly correlated. The rigorous way to 
take this into account is to consider the full covariance matrix of the results. We 
define 

(Tij = Co\{yi,yj), Si = yi - f{xi;ai, ...,an). (b.i8) 
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Let us mention in passing that the covariance Cov( A, B) can be estimated from our 
jackknife blocks as follows: 



[Cov{AB)] = {B-l) 



E(AjK,-[A])(BjK,-[B]) 



Then, the complete estimator is 



(B.19) 



(b.20) 



which, of course, if the data are not correlated is the same as (b.i6), since an = A 



The errors in the parameters are then estimated just as in the uncorrelated case. 

The actual minimisation of can be performed in a number of ways. Let us 
first note that, if / is linear in the fl,, then ( |b.2o[ > is just a quadratic form in the fl/, 
whose minimisation yields a set of linear equations. If / is non-linear, however, 
we must usually employ an iterative method. In this thesis we have used the 
Levenberg-Marquardt algorithm jpRE92] . 

B.3.1 Singularities in the covariance matrix 

Sometimes the data correlation is so extreme that the covariance matrix turns out 
to be singular. This is the case, for instance, where we include My random variables 
y, in a fit where we only have AAgampies < -M/ samples (or jackknife blocks). In order 
to prove this, let us denote each random variable with a Latin index i = 1, . . ■ ,My 
and each sample with a Greek index jd = 1, . . . , A4ampies- now define the matrix 



T- 

1 ji 



1 



samples 



M. 



(B.21) 



' samples 

Now, since A/'gampies < -M// it is clear that T/^ 
other hand, the covariance matrix is just 

1 



E y'i 

has at most rank A/'gampies- Ori the 



samples ( -^samples 



1) 



TT 



Therefore, 



rk{a) = rk(rT+) = rk(r) < K^mpies < My 



(B.22) 



(B.23) 

and the matrix a is singular. The most obvious example of this are the fits of tem- 
poral correlations in Part IV where we must consider some 10^ random variables 
extracted from a set of a few hundred samples. 

Throughout this thesis, we have followed an empirical procedure to address this 
issue. We find the best fit parameters using only the diagonal part of the covariance 
matrix, i.e., minimising the so-called diagonal chi-square 



(B.24) 
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In order to take the correlations into account, we perform an independent fit for 
each jackknife block and use their fluctuations to compute the errors in the a,. We 
have found this method to be reliable (see Section B.3.3.1 below), even if it compli- 
cates the interpretation of the x\ values. Indeed, since the fitting function fluctuates 
coherently with the data, the values of ;^^/d.o.f. are typically much smaller than 
1. In effect, behaves as if there were fewer degrees of freedom, so we cannot 
readily compute the P values. 

Thro ughout this thesis, we shall use to denote either the full estimator 
of (B.20) or the of ^ fit of uncorrelated data and we shall denote by x\ 
diagonal chi-square of (b. 24 1. 



B.3. 2 Errors in the x and y coordinates 

Thus far we have assumed that there are only errors in the y coordinate of the data, 
which is often the case. If this is not true, we would have to estimate the chi-square 
as (assuming no data correlation, for simplicity) 



2 _ yC {yi- f{xi;ai,...,an)f 
Var(i// - f{xi-ai,...,an)) 



(B.25) 



Notice that now the which we must change during our iterative procedure to 
perform the fit, appear also in the denominator. For non-linear / the corresponding 
expressions rapidly get intractable with our usual numerical methods. 

Sometimes one can skirt the issue by performing separate fits for the and y; 
and relating the resulting fit parameters with those of the real fit io y = f{x) (see. 



for instance. Section 10.4 1. If this is not possible, one can often consider a simplified 
procedure for minimising (B.25 1. Suppose that the errors in (X;, are similar for 
both coordinates and for all values of i. Then the denominator in ( b.25[ > is almost 
constant in i and the values of the a, that minimise 



(B.26) 



i=l 



should be very similar to those that minimise the real x^ estimator. Now, this 
would give us an approximation to the values of the best-fit parameters, but not an 
idea of the fit's goodness. But the latter can be estimated by an a posteriori x^ test 



with (B.25 */ where we no longer vary any parameter. 



B.3.3 Case studies 

In the remainder of this section we present two examples of the previous methods, 
taken from our work. 
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B.3.3.1 Estimating the error in the replicon exponent 

In this section we perform a detailed test of our technique for coping with singular 
covariance matrices. We have chosen the computation of the replicon exponent a, 



detailed in Section 11.3.1 There are two reasons for this: first, it is a particularly 
tricky case, with both statistical and systematic error sources for the i/;. Second, we 
actually have two independent determinations, one with ten times more data, so 
we can check the reliability of the method. 

Let us recall the steps of the computation. We want to estimate the exponent a 
in an equation 

h{t^)=Af-''{t^). (B.27) 

The quantities Ii and ^ are estimated from integrals of the spatial correlation func- 
tion for each time, using a dynamical cutoff procedure: 

[hM] = rdrr'^[C4(r,iw)], (B.28) 
Jo 

where the cutoff A is chosen as the point where the relative error in [C4(r, f^)] 
becomes larger than 50%. The coherence length is estimated as 

m^)] = [hM]/[h{u)]. (B.29) 

We then perform two independent fits, using the diagonal chi-square method, 

7i(fw) = BC ^(^w) = Ctl^\ (B.30) 

and we finally estimate a as 

a = 2 — cz, (B.31) 

a relation that is used, of course, for each jackknife block in order to estimate errors. 

We first performed this procedure in | |jano8cU , for simulations at T = 0.7 with 
63 samples, obtaining the result 

a = 0.355(15). (B.32) 

The computation was then repeated in [ jjANogA] , this time with 768 samples, pro- 
ducing 

a = 0.397(12). (B.33) 

The discrepancy in the results could be due to a simple statistical fluctuation, but 
they are different enough to make this unlikely. Another hint that something may 
be amiss is that the error has not decreased nearly the factor ^768/ 63 that one 
would expect from the increase in statistics. 

Therefore, we may have underestimated our errors, or there may be some bias 
in our methods (or in our statistics). The first source of systematic error is the 
dynamic cutoff method. Naturally, for the simulations with 768 samples the cutoff 
point A is larger. This has the effect of trading statistical uncertainty (including 
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Figure b.i: Left: Probability density function of a^^^\ the estimate of exponent a as obtained 
from a set of 63 samples. The dots with horizontal error bars are, from top to bottom: our 
best estimate with 768 samples, the value with the 63 samples of |jano8c| and the mean 
and standard deviation of a^^'^\ The continuous line is a Gaussian distribution with the 
same mean and variance. Centre: As in left panel, for the jackknife errors A a ■ The vertical 
line marks the standard deviation of a^^^\ Right: Histogram of the ;^;^/d.o.f. for the ^ fit, 
which had 80 degrees of freedom. We see that there is a very large variance, indicating 
that the effective number of degrees of freedom is much smaller. As a comparison, we have 
plotted with a continuous line the chi-square distribution (per d.o.f.) for only 8 degrees of 
freedom. 



more noisy data at the tails) for a reduction of systematic errors, and is probably at 
the core of the small decrease in our error estimate. 

However, the effect is large enough to merit a more detailed study, which we 
shall carry out with a bootstrap procedure (see, e.g., |jEFR94|). Our objective is to 
estimate the probability distribution of a^^'^^ defined as the result of computing 
a from a statistic of 63 samples. This can be easily done, since from our second 
set of simulations we have 768 samples. Therefore, we can pick random sets of 63 

samples and determine for each set the estimate of fl^^'^^ and of its error A^^'^K There 
are {^^^) ^ 10^^ possible combinations, so our distribution of a^^^^ computed from 
a set of 768 samples can be considered the 'real' one. 

We have used 10 000 random sets, with which we h ave computed normalised 



histograms of fl'^^'^^ and its errors, plotted in Figure b.i From the first histogram 
we can take several conclusions: 

1. The distribution of Gaussian, with a very good approximation. We 
denote its mean 

by E(fl(63)). 

2. We were unfortunate in | |jano8c[ , in that the 63 samples used therein are 
'atypical'. More precisely, the samples of | |jano8c| produce an estimate of 



that is 2.2 standard deviations away from its true expectation value 
E(fl(^^)) (there is only a 3% probability to obtain a larger fluctuation in Gaus- 
sianly distributed results). 
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Figure b.2: Exponent a(tw) for our T = 0.6 simulations, with a fit to a quadratic polynomial 
in log t„. We show on the left panel the same data as in Figure 
the result of binning the data in blocks of 5 consecutive times. 



10.2 



and on the right one 



3. The random variable a^^'^^ seems to be a slightly biased estimator of a, E(fl(^'^)) 
showing a small deviation from our estimate of a with 768 samples. This is 
probably due from the cutoff effects discussed earlier, although the fact that a 
is obtained from a non-linear operation may also produce a small bias in the 
fitting procedure. 

From the second histogram, representing the pdf of A}, , we see that the jackknife 
method tends to underestimate the errors somewhat (compare the mean of this 
distribution with the vertical line, marking the true standard deviation of fl'^^'^)). 

As a final result, we can look at the histogram of the ;^^/d.o.f. obtained in 
the 10000 fits during the bootstrapping procedure. Each of these had 80 degrees of 
freedom, yet the standard deviation of the distribution is much larger than that of a 
chi-square distribution with 80 degrees of freedom. This supports our initial claim 
that fits with the diagonal chi-square procedure have a small effective number of 
degrees of freedom. 



6,3,3.2 The full-aging exponent 

Let us now consider our analysis of full aging in Section 10.2 The oi{tw) that we 
plotted in Figure 10.2 on page 172 showed suspicious fluctuations (this is especially 
noticeable for T = 0.6). This may lead the reader to think we have underestimated 
our errors when computing fits to { io.ig\ . 

We can check that the errors are actually correctly estimated by performing a fit 
of a: (^w) to a very smooth function. We have done this in the worst case (T = 0.6, 
Figure |b.2| — left) with a quadratic fit 



[tw) = ao + ai log iw + 0i2 log^ ^ 



(B.34) 
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Figure B.3: Left: Estimate of ft:(tw) in (10.191 for our T = 0.7 simulatior\s with 63 ar\d 768 
samples. Right: Values of x^/ d.o.f. for both series. 



For iw > 10^ we have 
ao = 6.4(6), 



cci = 0.19(8), 



^2 = -0.0035(20), 



(b-35) 



with a diagonal chi-square of ;\;^/ d.o.f. = 66.26/ 63. 

As an additional check we show in Figure |b.2| — right the same plot, but now 
binning the data in blocks of 5 consecutive times. The resulting parameters are 



X5) 



6.2(6), af^= 0.20(8), = -0.0039(24), (B.36) 

with /d.o.f. = 11.58/10. Therefore, neither the parameters nor the value of 
;^^/ d.o.f. are affected by the binning procedure. Notice that in Figure [b!2] — right 
the wiggles in the curve seem just what they are: random fluctuations. 

A second check is provided by our simulations for T = 0.7. For this tem- 
perature, we performed two series of runs (cf. Appendix |E|). The first, reaching 

= 10^^, included 63 samples. The second, for 768 samples, reached t^ = 10^^. 
We have plotted oc{tyf) for both sets in Figure B.3 We see that, increasing the num- 
ber of samples, the period of the oscillations does not change, but their amplitude 
decreases proportionally to the error. However, see right panel, the x^/ d.o.f., which 
is reasonable from ^ 10* for 63 samples, is not good with 768 until ^ 10^. 



(5) 



This indicates that there probably is a systematic deviation from (10.19). Neverthe- 
less, the estimates for a{tw) are compatible in a much wider range. 



B.4 

Using correlations to improve the statistics: control vari- 
ates 



In the above section, we have discussed the difficulties posed by data correlation. 
However, sometimes correlations can be exploited to our advantage. A good ex- 
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ample is the use of control variates (see jRUBoy] and jFERogAi l for a review, applied 
to Monte Carlo simulations). 

The idea is the following, assume we want to compute some parameter A and 
that we have a random variable A such that E(^) = A (that is, A is an unbiased 
estimator for A). Here E denotes the mathematical expectation, which in a Monte 
Carlo context can be either a thermal or a disorder average. In addition, we have 
an estimator B (the control variate) whose expectation value B = E{B) is known in 
advance. Then A' = A + dc{B — B) is another unbiased estimator for A, E(^') = 
E(^) = A. Furthermore, 

Var(^') = Var(^) + (x^Veir{B) + 2aCov( A B), (B.37) 

where Var denotes the variance and Cov{A,B) the covariance of A and B. It is 
easy to show that choosing 

Cov(AB) 



Yar{B) 



(B.38) 



minimises the variance of A', so that 



VariA') = (1 - n%^)yar{A), = .^""^[t^lr^V (^-39) 

^^Var{A)Var{B) 

where TZj,,B is the correlation coefficient of A and B. Notice that this process 
not only reduces statistical errors in the sought parameter, but also corrects finite- 
statistics biases. 

We have applied this method in order to minimise the variance in the disorder 
average of physical quantities in our DATE simulations of Chapter |8j Eor potential 
control variates we can consider the distribution of the vacancies iri the lattice. 
Indeed, we know that, for instance, the average value of spins in the lattice is 



AAspins = PN. (B.40) 

More generally, the distribution of empty nodes on the lattice should be completely 
random, with no periodicity or correlation. We can, therefore, consider the Eourier 
transform of the Cx, 

T{k)=J^exe-''-K (B.41) 

X 

We have, then. 



•^(0) = Ampins = pN, T{k) =0, ky^O. (B.42) 

Once we have found viable candidates for control variates, we have to check 
whether they are correlated with any physical quantity. Let us start with Ngpins, for 



which we show scatter plots in the first three panels of Eigure B.4 This quantity has 
a strong correlation with the energy of the sample (top left) and a slight correlation 
with the regular component of the tethered field (top right), but no correlation at 
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Figure B.4: Scatter plots of the energy and tethered field of individual samples against 
the number of spins (top panels and bottom left panel). The bottom right panel plots 
the staggered component of the tethered magnetic field against the T{k) of Eq. ( B.41I 1 for 
k = {n,n,n). All panels show our 1000 samples for L = 24 at /5 = 0.625, fh = 0.12 and 
nic = 0.5. 



all with bs (bottom left). However, see bottom right panel, this latter quantity does 
show a correlation with = T[k = (tt, n,n)). 

Therefore, we have used A/spins as a control variate for the energy and h and 
J-n as a control variate for h^. The correlation coefficients for an L = 24 system at 
^ = 0.625, m = 0.12 and = 0.5 arfl 

^«X3p,, = -0.85, n^^^^^^^ = 0.47, n^^^^^ = -0.62. (B.43) 



For the energy, the resulting reduction, Eq. (B.39I, is of about 50%. Even for b, 



the error reduction is of about 20%. Recalling that the statistical error otherwise 



goes as ^ 1 / y A/'sampies/ we see that using control variates for bs is equivalent to an 
increase of ~ 65% in the number of samples (and, hence, in the simulation time). 



^The numbers are similar for other lattice sizes and magnetisations. 
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Fine tuning Tethered Monte Carlo 



This chapter contains some technical details on our tethered simulations, as well 
as some practical recipes for the correct usage of TMC. We use the example of the 
D = 2 Ising model, as in Chapter |5| except for Section C.4 which explains our 
numerical implementation for the DAFF. 



Ci 

Numerical implementation of the Metropolis scheme 
for the D — 2 Ising model 



We have used the Metropolis algorithm explained in Section 3.2.1 A naive im- 
plementation is completely straightforward and no different than that of canonical 
Metropolis (see jAMi03] , for instance, for a complete working example). We can, 
however, significantly enhance the performance in a simple way. Indeed, using the 
explicit form (4^17]) of the tethered weight, the spin reversal probability is 



0, M' > M, 

min{l,e^^}, M' < M, 



where 



AS = -m' + {W - M) - - 1) log (1 - . (c.2) 

Now, in a Metropolis simulation, most of the CPU time is going to be dedicated to 
evaluating exp[AS], which includes computing a logarithm, and therefore needs a 
large number of elementary operations. However, there are only a finite number of 
possible values for AS. 

• {U' — U) = —Hsx, where H is the local field around x and can only take 5 
different values for this model, H = —4, —2,0,2,4. 
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• M' = M - 2sx and M can only take (N + 1) values. 

• The spin itself can take only two values. 

In all, we have 10(N + 1) possible values of the Metropolis update probability 
(many of which are going to be zero or one, because of the constraints). This is 
a manageable number, so we can compute them all and store them in memory in 
a look-up table before starting the simulation. This way, we remove the need for 
computing logarithms on the fly and greatly accelerate the code. 

A second note concerns the choice of the pseudorandom number generator. 
We originally used a 64-bit congruential generator reported in |osso4Bj l. We ran 
through the lattice sequentially, extracting one random number per site (although, 
in principle, we would only need one when AS < and M > M'). This reproduces 
the conditions studied in josso4Bf [osso4a|, where significant deviations from the 
expected values where found using the same generator in a D = 3 Ising model for 
N > 128^. Perhaps unsurprisingly, we also obtained wrong results for our larger 
lattices (N = 1024^). We solved the problem by repeating all our simulations 
with a more sophisticated generator, computed as the sum (modulo 2^4) of the 
congruential generator and a 64-bit version of the shift-register method reported 
in ]par83[ . 



C.2 

Sampling the m space 

The first practical decision in a TMC computation is the sampling of the coordinate 
space, that is, how many values of m to simulate and where to put them. This is 
an important matter because the reconstruction of the canonical partition function 
(and of the canonical expectation values) involves an integral over exp[— NHn], 
where is itself an integral of the discretised (fc),^^. A good sampling should 
make the discretisation error s ne gligible, as well as minimise the statistical ones. 

Recalling the definition of M, we see that this parameter can, in prin- 
ciple take values in the interval [— 1, oo). In practice, though, m ~ m + 1/2, so 
the pdf p(m) is going to be completely negligible outside a finite range (roughly 
[—1/2,3/2], except for very small lattices, where we need a broader scope). In 



addition, see Figure 5.1 p[m) is a two-peaked distribution, so values of m close to 
the probability maxima contribute more towards the canonical average. The peaks 
get narrower and closer together as L grows (notice that Oj^i has to be a convex 
function in the thermodynamical limit). It may seem that the choice of rh is quite 
delicate (especially considering we do not know the pdf until we have run the sim- 
ulation). Finally, given a fixed computational budget, is it better to compute (b)^^ 
at more points or more precisely at each point with a coarser grid? 

Let us assume we have measured (b)^ at K points in a grid. Our numerical 
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estimator [b]rn at point k is 

mk={b)k + Via k = l,...,K. (c.3) 

Here rj]^ are the statistical errors, expected to follow a Gaussian distribution and to 
be of similar size and statistically uncorrelated. Therefore, our numerical estimate 
for Djv will be given by some quadrature formula 

N ^ K 

On ^ [On] = Y^gk{b)k+Y^ Sm- (c.4) 

k=\ k=l 

The first term in this equation is subject to systematical error while the second one 
represents the statistical fluctuations. Naively, the quadrature coefficients scale 
as 1 / N, so the error will scale as 1 / y/N. This suggests that doubling the number 
of points is equivalent to doubling the simulation time in each one. The analysis of 
canonical expectation values is, however, more involved, since the errors in Hjv are 
going to be highly correlated for different rh. Therefore, we perform a numerical 
experiment. 



We report in Table c.i the values for the energy density (u) at the critical tem- 
perature for an L = 128 lattice, obtained in two different series of runs. In the 
first column, we take 10^ Monte Carlo sweeps (MCS) on each point, while in the 
second we perform 10^ such updates. In both cases the points are evenly spaced. 
In the third and fourth columns we report analogous simulations, but now with a 
greater density of points inside the peaks. We can reach several conclusions from 
this table: 

• If we use to low a number of points we will see a significant systematic error, 
no matter how precise they are. 

• Once the systematic error is under control, increasing the number of evenly 
distributed points has an effect of 1 / A/A/points in the statistical error. This is 
best seen in the leftmost column. The effect is roughly the same if we increase 
the number of MCS in each point by the same factor (the errors of the first 
column are ^ y/W times greater than the corresponding ones of the second). 

• If we add more points inside the peaks, the error may decrease faster than 
1 / a/ A/points- The errors with A/'points and uniform sampling are only slightly 
smaller than those with A/'points/ 2 and improved sampling. 

We can summarise this analysis with the following prescription for the choice of fh: 

1. Run a first simulation with enough uniformly sampled rn so that the system- 
atic error is small or unnoticeable (i.e., so that the peaks of the distribution 
can be roughly reconstructed and the tails are reliably sampled). We have 
used ^ 50. This may seem a big number, but we must take into account that 
we have only looked at the energy in Table |c.i[ Other quantities, such as high 
moments of the magnetisation or observables at a nonzero magnetic field, 
require that the tails of the distribution be reasonably well sampled. 
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. r Uniform sampling Improved sampling 

points 

lO^MCS lO^MCS lO'^MCS 10'' MCS 



12 


1.43728(33) 


1.43738(11) 






23 


1.419 25(22) 


1.419117(6) 






46 


1.419 21(13) 


1.419 117(43) 


1.419 08(11) 


1.419 107(38) 


91 


1.419 14(10) 


1.419 093(36) 


1.419 13(8) 


1.419 128(31) 


181 


1.419 14(7) 


1.419 095(28) 


1.419 03(5) 




451 


1.419 06(5) 




1.419 073(39) 




901 


1.419 065(33) 




1.419 077(26) 




1801 


1.419 062(24) 








Exact 




1.419 076272086... 





Table c.i: Final value for — (w) as we change the number of points for the reconstruction 
of On and their precision. MCS = Monte Carlo Sweeps for each point. The runs labelled 
'uniform sampling' consist of A/"points values of m evenly distributed in the range [—0.4, 1.4]. 
The runs labelled 'improved sampling' have 2/3A/'pomts points evenly distributed in the 
same range, plus and additional A/'points / 3 inside the peaks, effectively doubling the density 
in the dominant regions. The last results of each column have a similar precision, but those 
computed with uniform sampling required twice the simulation time. The exact value has 
been computed from the results of [ |fer69| . 

2. Add a similar number of points inside the peaks of the pdf to eliminate the 
systematic error and reduce the statistical error. 

We have found that the second step is not always necessary. In fact, for lattices up 
to L = 256 we have limited ourselves to computing 51 evenly distributed m. For 
bigger lattices the peaks are steeper and we have added another 26 points inside 
them. 

This general prescription can be generalised to more complicated situations 
(such as the DAFF studied in Chapter |8]). The rule of thumb is that the final results 
should not change, within the statistical errors, even if we remove several points. 

A final comment concerns computation time. Obviously, it takes as much total 
CPU time to run 10 simulations of length T as one simulation of length lOT. The 
wall-clock, defined as the physical time we have to wait for the results, is, however, 
10 times smaller in the former case, since the different tethered simulations are 
trivially parallelisable. Therefore, in a large-scale computation and so long as ther- 
malisation has been ensured, it is generally better to add more points rather than 
extend the original ones. 
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c.3 

Integrating over m space 



The numerical implementation of (4^23]! can be done in several different ways of 



similar numerical accuracy. Here we briefly explain our choices. Once we have the 
(0)j^,^ we need to take three steps 

1. Interpolate the tethered averages as a smooth function of fh. 

2. Integrate {b)^ to get On- 



3. Use (4.4) to compute the canonical averages. 



For the first step we have used a cubic spline interpolation (see, e.g., |jFRE92| for 
an implementation). We have not used the so called natural spline, which imposes 
vanishing curvature for (b)^ at rhmax and rfiram- Instead, we have estimated the 
derivative of this function at both ending points with a parabolic interpolation. 

In order to compute the potential Qj^ we have to integrate (b)^ for our whole 
ih range. In principle we could compute 0]\i from 

lN{rh) = dm' (b)^,, (c.5) 

Ijv defines Oj^ up to an additive normalisation constant. This expression has the 
disadvantage of not treating symmetrically the m range. In particular, values of m 
close to mmin will have smaller errors than those close to rfimax- In order to avoid 
this problem, we use the average of the integral in both directions, 

= -( dm' (b),, - / dm' {b),, . (c.6) 

Notice that, assuming mmin and mmax are far enough as to have negligible proba- 



bility, both (c.5) and (c.6 1 are equivalent, considering the potential nature of Ojv. It 
is only when we consider numerical issues that the second is preferable. 
We now normalise the pdf, 

C = dm exp[—NlN{rh)], (c.7) 

SO that 

nN(m) = JN(m)-llogC. (C.8) 

Some comments are in order. First of all, naively applying this interpolation 
scheme for the exponential, p(m) = exp[—Nn]si{rh)], could introduce strong in- 
tegration errors. Fortunately, this can be easily solved by accurately representing 
r}]v, which is a smooth function (recall the curve in logarithmic scale of Figure 
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In particular, since we have represented the tethered field with a cubic spline, 
is a fourth order polynomial between each pair of simulated points, which 
can be exactly computed. To avoid losing precision, we evaluate C}^{m) in an 
extended grid that includes 3 equally spaced intermediate points between each 
pair of simulated values of m. This way the Lagrange interpolating polynomial for 
each segment of the extended lattice (two original points plus three intermediate 
ones) represents the exact integral of our spline. 

Of course, the pdf, exp[— NDjv(m)], is not a polynomial anymore. It is, how- 
ever, a smooth function, so a self consistent Romberg method jpRE92j provides an 



estimate of the integral (4.4 1 with any required numerical accuracy. Notice that this 
yields the basically exact results for a given interpolation of (&)^, but it does not 
cure any discretisation errors introduced by the spline, which should be minimised 
with the prescription of the previous section. 

Throughout this process, we estimate errors with the jackknife method (see 
Appendix |B|). For instance, we compute and integrate a different cubic spline for 
each jackknife block. This way, data correlation is safely taken into account when 
estimating the statistical errors. 

Typically, even with a very moderate effort, the Romberg integration error has 
been much smaller than the statistical one. There is one exception: the fluctuation- 
dissipation formulas, such as ( |5.2[ >, because of the large cancellations between the 
two terms. To solve this problem, we have computed the fluctuation-dissipation 
formula as a sum of two squares: 



[u^)^-{u)l+{u)l-{uf 



N-^C = {u^) - {uf = j dm p{m) 

dmp{m) {{u-{i2)^r)^ + {{u),^-{u)r . (c.9) 



In spite of this, as a consistency check, we have also employed the original equation 
and forced the Romberg integral to yield the same value, by reducing its tolerance. 

It is sometimes interesting to compute high moments of the magnetisation. One 
obvious possibility is to measure the instantaneous values for m^(m; {cx}) during 
the simulation and then compute the tethered and canonical averages as usual. But 
TMC provides an alternative way of calculating (m^). Indeed, we have the whole 
pdf p(m) and we know that M = M + R. Now, the moments for R can be easily 
obtained analytically, so it suffices to compute (m^) to reconstruct (m^) without 
any need for individual measurements of m^. For example, 

(m2) = ((m- 1/2)2)-^. (CIO) 
(m4) = ((m-l/2)4)-l(m2)-^ + ^. (cii) 

These formulas are valid for the symmetric phase, where (m) = 0. We have com- 
puted the moments of the magnetisation up to (nfi), both from individual mea- 
surements and with this procedure, and the results are identical. This will not be 
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at all surprising once we see Section 5.5 where it is shown that the correlation time 
for {m)fn is < 1 (which means that the uncertainty in p{m) is going to determine 
the total error). 



C.4 

Numerical implementation of TMC for the DAFF 

The tethered simulations of the DAFF were also carried out with a Metropolis 
algorithm. In principle we could have used the, potentially more efficient, cluster 
method studied in detail in Appendix [6| Indeed, the formulation for the DAFF 
follows the same steps as that for the Ising model , with the exceptions that the 
clusters are now of constant staggered spin and that we now have two tethered 
quantities, so now the cluster flipping weight is 

cv{{Si}) cKj{m,m)j{ms,ms), (c.12) 



where j(x,x) was defined in (4.33 1. Unfortunately, for the relevant temperature 



and magnetisation regimes for the DAFF, the spin configurations are not well suited 



to cluster methods. In particular (see Section 8.6 1, the whole configuration is dom- 
inated by two very large clusters of opposite staggered spin, of similar size for 
rhs = 0.5 and with one dominant cluster in magnetised regions. What few spins 
remain outside them are scattered in a few very small clusters and a large num- 
ber of single-spin ones. In these conditions, and given the stringent constraints on 



(m,ms) imposed by (c.12 1, the cluster flipping would only invert loose spins and 
leave the large clusters untouched. 

We must, therefore, stick with the Metropolis scheme. Still, there are a few 
things we can do to optimise the simulation. First of all, we avoid computing loga- 
rithms when evaluating the Metropolis a ccep tance probability with the preparation 



of look-up tables, as discussed in Section C.i Now we have to consider 0{N ) val- 



ues for (M, Ms), so we cannot fit all the possibilities in a single array. However, 
since the contributions of U, M and Mg to the update probability are factorised, we 
can simply prepare one look-up table for each of these quantities and evaluate the 
whole Metropolis probability with a simple multiplication. 

The peculiarities of the DAFF in the tethered formalism afford us an additional 
possible optimisation. It turns out that the Metropolis acceptance is very low, of 
about 10% in most cases (this is mainly due to the regular magnetisation, which 
is in a very constrained regime due to the high applied fields that we consider). 
In addition, with the look-up table method, the computation of the acceptance 
probability is faster than the generation of a single pseudorandom numberj^ 

With this in mind, let us suppose we are performing a sweep of the whole 
lattice, updating nodes to N — ij^ Let us call the probability of flipping spin 



^We use the same combination of congruential and shift-register generator as in Section 
^Empty nodes are simply skipped, we run through the lattice sequentially 
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i and qi = 1 — p; the probability of rejecting the change. Then, the probability of 
leaving the spins from i = kio i = s unchanged 

s 

Qk,s = n^/ = Q/c,s-i^s- (c.13) 

Therefore, we can define a 'survival' Metropolis scheme in the following way: 
• For = 0, . . . , N - 1 do 

1. Generate a single pseudorandom number 7?. G [0, 1). 

2. Compute Qfc^s until Qfc,s-i > 7^ > Q^^g. Then flip spin s. 

3. Set = s + 1. 

This algorithm obviously satisfies the balance condition for a Monte Carlo method 
and should be more efficient than the standard Metropolis for low acceptance prob- 
abilities. In practice, modern compilers make this a strongly device-dependent 
optimisation. We have found that for modern Intel processors, using the Intel C 
compiler ice, the gain is negligible for the acceptance of 10% found in our DAFF 
simulations. However, more than half of our simulations (over 3.5 million CPU 
hours) were carried out on the Mare Nostrum supercomputer, of different design, 
where the survival version of the Metropolis scheme nets us a 30 to 40% accelera- 
tion. 

Finally, we take one parallel tempering (Section A.3.i[ > step for each lattice up- 



date, attempting to exchange configurations for the same (m, ms) but different /3. In 
the tethered weight ( 4.i7[ >, the factors that depend on the magnetisation are inde- 



pendent of /3, so the parallel tempering acceptance probability is exactly the same 
as that for a canonical simulation, given by ( a.3o[ >. For our longest simulations, we 



use a parallel version of our code that runs the Metropolis update of each of the 
participating configurations in a different processor. When the whole lattice has 
been updated, the energies and temperatures of each configuration are sent to the 
master process (with MPI, Message Passing Interface), which performs the parallel 
tempering update and sends back the new temperatures. In this way the total CPU 
time is of course unchanged, but the wall-clock is reduced by a factor Mj. 



APPENDIX D 



The Janus computer 



The Monte Carlo simulation of complex disordered systems has an unquenchable 
thirst for computing power. Any acceleration due to innovations in computer de- 
sign is immediately exhausted. In addition, the terrible scaling properties of the 
characteristic times (often exponential, as we pointed out in Chapter ^ mean that, 
in order to obtain a qualitative improvement in the data, one needs an acceleration 
of orders of magnitude. 

In these conditions, the investigation on new, optimised, Monte Carlo meth- 
ods in not only necessary but also rewarding, as we saw with our examination of 
the Tethered Monte Carlo method during Part |ll] of this dissertation. Sometimes, 
however, devising ingenious dynamical algorithms is not an option. 

A prime example of this issue is the non-equilibrium dynamics of spin glasses. 
Here, we face the following constraints 

• The interesting physical phenomena occur at macroscopic times, with the 
typical experimental scale ranging from a few seconds to a few hours. 

• The physical dynamics can be faithfully simulated with heat bath or Metropo- 
lis dynamics, but each MCS is equivalent to only ^ 10~^^ s jMYD93] . 

• One has to simulate very large lattices, to keep the system in a truly off- 
equilibrium regime, free of finite-size effects. Therefore, each MCS takes some 
time. 

Notice that it is not easy to see how one could improve this situation with a clever 
update algorithm, since we want to reproduce the physical dynamics. 

Thankfully, the physics of the spin glass provide a different avenue for accel- 
eration: parallelisation. Indeed, the nearest-neighbours nature of the interaction 
means that we can divide each plane of the lattice in a checkerboard scheme (Fig- 
ure D.il, with sites of the same colour being statistically independent from one 
another. That is, one could in principle update the whole lattice in two steps: first 
all the black sites at once and then all the white sites at once. 
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Figure d.i: Division of the spin-glass lattice in a checkerboard scheme. The first neighbours 
are always the opposite colour of the site, so sites of the same colour can be updated at the 
same time. 



The problem with this idealised scheme is that conventional computers (CPUs) 
are simply not equipped to handle it: 

• CPUs are optimised for operations on long data words, but we need (i) op- 
erations on single bits (the spins) (ii) variables that only appear in a small 
number of states (the local field needed to compute the update probability). 

• The memory architecture does not permit the processor to gather all the nec- 
essary information quickly enough. 

• We need one random number for each node that we update, so random- 
number generation is another bottleneck. 

Sharing the computation across several CPUs (making each handle one portion of 
the lattice) does not work due to communication limitations: even if we only need 
small chunks of data, they have to be accessed extremely often. 

One possible answer to this problem is designing a special-purpose computer, 
whose architecture is optimised for simulations of spin systems. We note that 
this idea is not new, custom computers have been used in statistical mechanics for 
some time ]oGi83[ iCRUOiil . The Janus computer, fruit of a collaboration between 



physicists and engineers in the universities of Zaragoza, Complutense de Madrid 
and Extremadura (in Spain) and Ferrara and Rome i 'La Sapienza' (in Italy), is one 
such custom machine. 

Since the author was not involved in the hardware design or the low-level pro- 
gramming of the machine, just in the physical analysis of the simulations, this is not 
the place to give a detailed account of the machine. We shall only consider some 
brief notions to understand why Janus can outperform conventional computers. 
The interested reader is referred to ||jano61 , |jano9b| for details on the architecture 
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and to Pjano8b| for an explanation of the implementation of the Monte Carlo sim- 



ulation. See also the doctoral dissertation of R Mantovani IImanoSII . 

The solution adopted by Janus consists in replacing the conventional CPUs by 
Field Programmable Gate Arrays (FPGAs) as computing nodes. These devices offer 
a large number of reconfigurable logic resources, so they can be divided into many 
update engines, each taking care of the update of one spin. 

The FPGAs are grouped in 4 x 4 arrays, forming boards (Figure d.2[ >. Each 



board includes data links among nearest neighbours and an input/ output proces- 
sor (lOP), whose job is to distribute work among the computing nodes and gather 
the results. There is a standard PC (called host) for each two boards, which acts as 
the interface and storage element. The whole Janus computer consists of 16 boards 
(a total of 256 processors) in one rack (Figure d.3[ >. 



In the basic implementation for non-equilibrium dynamics, each FPGA simu- 
lates two replicas of one sample, divided in the checkerboard schemej^ The im- 
plementation seeks to maximise the number of spins that can be updated in each 
clock cycle. In particular, each FPGA can be divided in 1024 update engines. Each 
of them takes the following steps 

1. Reads as input 6 nearest neighbours and the corresponding 6 couplings. 

2. Computes the local field and addresses a pre-computed look-up table (cf. 
Section C.i| >. 



3. Compares the result with a freshly generated random number. 

4. Sets the new value for the spin. 

This whole process is pipelineable so that each of the 1024 update engine updates 
one spin per clock cycle (notice that this requires a clever memory architecture, so 
the needed information can be accessed with no latency). 

Another potential bottleneck is the generation of random numbers. As we 
indicated above, we need one random number for each spin of the 1024 spins that 
we update in each clock cycle. For these, we use the Parisi-Rapuano shift-register 
generator [ |PAr83 |. We have a wheel I{k) of 62 numbers (32-bit integers). Then, in 



order to generate a random number R{K) we perform the following operation 

R(k) = {I{k - 24) + I{k - 55)) (8 I{k - 61). (d.i) 

Then, the wheel is shifted (we increase k) and update the k position with the pre- 
viously computed sum 

I{k) = I{k-24:) +I{k-55). (d.2) 
The value of k is, of course, always taken modulo 62. 



^Actually, for optimisation purposes, two mixed lattices are formed, one containing the white 
nodes of replica 1 and the black ones of replica 2 and the other contauiuig the complementary sites. 
Then all the spins within the same mixed lattice can be updated at the same time. 
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Figure d.2: Board of the Janus computer. Each of the 16 scientific processors (SPs) is an 
FPGA. These are connected along nearest-neighbour links, with periodic boundary condi- 
tions. The board includes an input/ output processor. 




Figure 0.3: The complete Janus computer. 




Figure D.4: Left: One step of the Parisi-Rapuano generator, implemented through a logical 
circuit. Right: Nested generation of three random numbers at once with the same Parisi- 
Rapuano wheel. At the cost of complicating this circuit, Janus generates up to 96 random 
numbers per wheel per clock cycle. 



D.I — Performance 
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Conventional 

computers JANUS Experimental times 
I 1 1 1 1 H 1 1 1 1 

10"^ 10"^ 10""^ 10"^ 10"^ 10"^ 10° 10^ 10^ 10^ 

Time (seconds) 

Figure D.5: Comparison of the time scales accessible to Janus and to conventional com- 
puters. 



The interesting feature of this generator is that it can be easily implemented 
through a logic circuit (Figure |d.4| — left). More than that, at the cost of compli- 
cating the circuit, we can consider the simultaneous generation of many random 
numbers from the same wheel — several iterations of ( d. 1 ) and ( d.2 ) — by setting up 
a cascade-structured combinatorial logic. This is represented for the simultaneous 
generation of three random numbers in Figure D.4 — right. In the actual implemen- 



tation, Janus is capable of generating 96 random numbers for each wheel in each 
clock cycle (we still need to keep several wheels turning at the same time to keep 
up with the needs of the update engines). 



D.i 

Performance 

Janus is capable of simulating a system of size up to L = 96 in each FPGAj^ The 
above-outlined implementation is capable of exhausting 88% of the available logic 
resources. The clock frequency of the machine is of 62.5 MHz, so the performance 
of each FPGA is 

16 ns 1 cycle ^ ^ , . , . 

^ mo/ • ~ 16 ps/spm (D.3) 



1 cycle 1024 spins 

This number is, however, too abstract. One can update many spins per second by 
simply running many independent simulations at once, without accelerating any 
of them. The real measure of the performance of a computer (or an implementa- 
tion) is the new physics to which it grants us access. In this sense, in the above 
implementation Janus is capable of simulating 256 samples of an L = 80 Edwards- 
Anderson spin glass for 10^^ MCS in less than a month, a time long enough to 



understand what happens at the experimental scale, as we see in Chapter 10 (see 



Figure D.5 1. We also note that one can simulate even larger systems by sharing 
the workload among several FPGAs. For instance, a Janus board can simulate an 
L = 256 lattice for 10^*^ MCS in less than ten days. Notice that this means that 
fewer samples can be simulated at once, but this is of no consequence due to the 



^The FPGAs are Xilinx Virtex4-LX200. 
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self-averaging nature of the physical observablesjj 

Let us stress that Janus is optimised for accelerating the simulation of a single 
sample. If all one needs is to run many samples for a comparatively short time each 
— as is commonly the case for critical-point studies — Janus is also going to be very 
efficient, but there are simpler alternatives (the most popular being asynchronous 
multi-spin coding in CPUs or even graphic cards). In this dissertation, we con- 
sider also equilibrium simulation with Janus, but these are in the low-temperature 
regime where the thermalisation time for each sample is exceedingly long (cf. Ap- 
pendix |E]). 

We finally note that, aside from the simulations for the Edwards- Anderson spin 
glass with Ising spins reported in Appendix |E| Janus has been used to simulate 
the Potts spin glass (where the spins can take several values, not just ±1) jjANogcf 
IjANiocj . In this case, the gain factor with respect to conventional PCs is even 
greater. 



3At the time of running our non-equilibrium simulations, the code for the distributed simulation 
of an L = 256 system was still not ready, so we ran L = 80 systems instead. 



APPENDIX E 



Our spin-glass simulations: parameters and 
thermalisation 



We report in this Appendix the parameters of our spin-glass simulations and give 
some technical details on our logistics and (in the case of the ones in equilibrium) 
thermalisation tests. 



E.i 

Non-equilibrium simulations 

We have simulated the direct quench protocol explained in Chapter [9] for several 
temperatures. We have used heat bath dynamics, in the highly parallelised imple- 
mentation described in Appendix D The parameters of these runs are included in 
Table Ie.iI 

We have used lattices of linear size L = 80, simulating two real replicas for each 
sample. In a first simulation campaign, originally reported in |jano8c [, we ran one 
sample in each of our 256 FPGAs for just under one month, resulting in 96 samples 



L 


T 




-^samples 


80 


0.6 




96 


80 


0.7 


lOii 


63 


80 


0.7 




768 


80 


0.8 


lO^i 


96 


80 


1.1 


4.2 X 10'^ 


32 


40 


0.8 


2.2 X 10^ 


2218 



Table e.i: Parameters the non-equilibrium simulations studied in this dissertation. The 
overall wall-clock time needed was less than six weeks. 
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for T = 0.6,0.8, 63 samples for T = 0.7 and 32 for T = 1.1 ^ TcQ Our wall-clock 
time of 24 days was enough to reach 10^^ MCS for our subcritical simulations. At 
the critical point we only ran for 4.2 X 10*^ MCS, using the freed-up time to run 
some shorter simulations for smaller lattices. These were used exclusively to check 
for finite-size effects (cf. Section 10.3.2). 

We were interested in two kinds of dynamics: those at the critical temperature 
and those representative of the low-temperature phase. In the latter, one is inter- 
ested in a temperature low enough not to be dominated by critical effects. However, 
the lower the temperature, the slower the dynamics (where the speed is measured 
by the growth rate of the coherence length). Since one is interested in seeing as 
large a coherence length as possible, one should try to find a middle ground. After 
analysing the results of this first batch of runs in | |jano8c| , we decided that T = 0.7 
provided the best compromise. Therefore, we carried out a new set of simulations 
at this temperature, increasing the number of simulated samples by an order of 
magnitude. In these new runs, first reported in jjANogA] , we took only 10^*^ MCS, 
because after that time we could begin to resolve some finite-size effects in the co- 
herence length growth at that temperature. When giving non-equilibrium results 
at T = 0.7, we shall always refer to this larger set of runs, unless we say otherwise. 

We note that T = 0.7 is also the lowest temperature at which we could safely 
thermalise our largest samples in our equilibrium simulations (cf. Section E.2[ 
below). Therefore, this should be considered as our main working temperature 
and will, in particular, be the one we consider when establishing the quantitative 
statics-dynamics equivalence, one of our main results. 

Finally, let us give some logistical notes. Some of the physical quantities we 
study are time-consuming to measure and some are even functions of the config- 
urations at different times. Therefore, rather than taking complex measurements 
during the simulation (online) we saved to disk the spin configurations at log- 
arithmically spaced times. In particular, we stored all the information at times 
[2^/^] + [2}^^], with integer i and ; (the square brackets denote the integer part). 
This choice was made so that our t and t^ (cf. Chapter [9]) could be both studied in 
a logarithmic scale. 



E.2 

Equilibrium simulations 
E.2.1 Set-up of the simulations 

For our equilibrium simulations we used the parallel tempering algorithm de- 
scribed in Appendix |Aj In this case we simulated four real replicas for each sample 
(in order to have better statistics for some fine analyses, as well as for constructing 



^One of the FPGAs was faulty (it has since been replaced), so the total number of samples was 
actually 255. 
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unbiased estimators for some sophisticated observables). 

The parallel tempering was coupled to heat bath dynamics. The implementa- 
tion in this case was a little different than that explained for the non-equilibrium 
dynamics in Appendix |D| In particular, since we had to run many temperatures 
for each replica, each FPGA simulated just one replica, so each sample was spread 
across four computing cores. Due to the special architecture of Janus, the parallel 
tempering step is not costless, as was the case for the DAFR This is because the 
FPGAs first have to compute the total energy for each temperature. We thus equi- 
librate the computational cost of both updates by performing 10 heat bath steps 
for each parallel tempering update. This hardly affects the efficiency of the parallel 
tempering scheme. 

As was the case in our off-equilibrium dynamics simulations, most of the analy- 
ses were performed offline. In particular, for the shortest simulations (the samples 
that do not need extending after applying our thermalisation criteria) we stored on 
disk about ^ 100 evenly spaced configurations. This number grew proportionally 
to the length in the case of extended runs. Notice that this is in stark contrast with 
the DAFF, where we just saved the last configuration to act as a checkpoint for 
possible extensionsj^ 

In particular, we needed many configurations on disk to perform the analysis 



of iixed-q correlation functions, as we explain in Section E.2.3 



Finally, for a few specific samples (one for L = 24 and four for L = 32) the 



wall-clock time required to fulfil our thermalisation criteria (cf. Appendix A) was 
exceedingly long, more than six months. For these cases we used a special low-level 
code that spread the different temperatures for each replica across several FPGAs, 
thus speeding up the simulation (recall that Janus has very fast connections be- 
tween the FPGAs on the same board). 

For the smaller lattices (L < 12) we ran the simulations on Terminus, the com- 
puting cluster of the BIFI. Even if these were very small systems, we thermalised 
them down to extremely low temperatures, so the simulation time was far from 
negligible. 

E.2.2 Parameters of our simulations 

In our DAFF simulations, we set-up our parallel tempering scheme with a very high 
maximum temperature, where the copies of the system achieved decorrelation very 
fast. 

Here, however, we are not interested in the critical point, but rather in the 
low-temperature physics, which makes thermalisation all the more difficult. For 
that reason, the simulations are several orders of magnitude longer than in the 
DAFF's case and, hence, for the same ratio of autocorrelation time at Tmax over 
total simulation time we do not need to reach such high temperatures. Still, we 
can perform a quantitative analysis in order to justify our parameters (which can 
be seen in Table e.2[ ). 



^Actually, these configurations were also used for the geometrical study of Section 8.6 
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L 










J V TTT) 


riD 


riD 


A/; 


System 


8 


0.150 


1.575 


10 


10^ 


5.0x10*5 


8.30 xlO^ 


7.82 xlO^ 


4000 


PC 


8 


0.245 


1.575 


8 


10^ 


1.0x10^ 


6.48 X 10^ 


2.30x10^ 


4000 


PC 


12 


0.414 


1.575 


12 


5x10^ 


1.0x10^ 


1.53x10^° 


3.13x10^ 


4000 


PC 


16 


0.479 


1.575 


16 


10^ 


4.0 xlO^ 


2.79x10" 


9.71x10^ 


4000 


Janus 


24 


0.625 


1.600 


28 


10^ 


1.0 X 10'' 


1.81x10^2 


4.02 xlO'* 


4000 


Janus 


32 


0.703 


1.549 


34 


2x10^ 


4.0x10'^ 


7.68x10^1 


1.90x10^° 


1000 


Janus 


32 


0.985 


1.574 


24 


2x10^ 


1.0x10^ 


4.40 X 10'^ 


1.16x10^ 


1000 


Janus 



Table e.2: Parameters of our spin-glass parallel tempering simulations. In all cases we have 
simulated four independent real replicas per sample. The A/j temperatures are uniformly 
distributed between Tmin and Tmax (except for the runs of the first row, which have all 
the temperatures of the second one plus T = 0.150 and T = 0.340). In this table A/'mes is 
the number of Monte Carlo Steps between measurements (one MCS consists of 10 heat- 
bath updates and 1 parallel-tempering update). The simulation length was adapted to the 
thermalisation time of each sample, using the methods of Appendix |Aj The table shows 
the minimum, maximum and medium simulation times (Ahb) for each lattice, in heat-bath 
steps. Lattice sizes L = 8, 12 were simulated on conventional PCs, while sizes L = 16,24,32 
were simulated on Janus. Whenever we have two runs with different Tmm for the same L 
the sets of simulated samples are the same for both. The total spin updates for all lattice 
sizes sum 1.1 x lO^'^. It is interesting to compare the length of these simulations with the 
ones for the DAFF in Table 



Following Ogielski I0G185 1, the equilibrium autocorrelation time in the thermo- 



dynamic limit is taken from a power law to a critical divergence 

THB(T)-(T-Tc)-^^ (E.l) 

For instance, for the maximum temperature used in our largest lattice, Ogielski 
found Thb ^ 10^. This is several orders of magnitude shorter than our shortest 
simulations in Table 



E.2 



We chose the minimum temperature so that the whole simulation campaign 
took about 200 days of the whole Janus machine and so that the lowest temperature 
scaled as Tc - T^in ^ L'^^". With 4000 samples for L = 16,24 and 1000 for L = 32, 
this resulted in Tmm = 0.479,0.625,0.703, respectively. 

As we cautioned in Appendix A trying to optimise the choice of the remain- 
ing parameters (number and distribution of the intermediate temperatures) is an 
unrewarding task. We dedicated several weeks to testing several combinations, 
mainly by modifying the number A/V of temperatures so that the acceptance of the 
parallel tempering exchanges varied between 7% and 30%. Perhaps against con- 
ventional wisdom, taking into account that the computational effort at fixed A/hb 
is proportional to Mj, we found that the efficiency hardly changed. Eventually, we 
settled for a spacing of temperatures that produced acceptances of ^ 20%. This 
both avoided unconventionally low acceptances and saved disk space. 
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E.2.3 Offline evaluation of observables 

As we indicated before, most of our analyses were performed offline, on previously 
stored spin-glass configurations. In particular, the computation of conditional cor- 



relation functions Ci^{r\q) — Eq. 10.58 — was not only time-consuming but also re- 
quired a lot of unrefined data. This was a problem, due to the scarcity of stored 
configurations. In fact, for the samples that were simulated only for the minimum 
simulation time, we had only A/"conf ^ 100 configurations stored on disk (ranging 
from A/'conf = 10 for L = 12 to A^conf = 200 in the case L = 32). We regard the sec- 
ond half (in a Monte Carlo time sense) of these configurations as well thermalised. 
Yet, when forming the overlap field, one needs only that the two spin configura- 

tions, {Sx } and {s)( be thermalised and independent. Clearly enough, as long 
as the two configurations belong to different real replicas and belong to the second 
half of the Monte Carlo history they will be suitable. There is no need that the two 
configurations were obtained at the same Monte Carlo time (as it is done for the 
online analyses). Furthermore, the four real replicas offer us 6 pair combinations. 
Hence, we had at least 6 x (A/'conf/2)^ 10000 (60000 for L = 32) measurements 
to estimate the overlaps and the correlation functions. We used the Fast Fourier 
Transform to speed up the computation of the spatial correlations. For those sam- 
ples that had more configurations (because their total simulation time exceeded 
■^min)' considered nevertheless A/'conf/ 2 configurations evenly spaced along the 
full second half of the simulation. 

For some quantities, such as the spin overlaps, we did have a large number of 
online measurements. Therefore, When some quantity, for instance the p{ci), could 
be computed in either way, online or offline, we have compared them. The two 
ways turn out to be not only compatible, but also equivalent from the point of 
view of the statistical errors. As an example of this let us compute the following 
quantity: 

2 

f^lik = (Qfink) - (Qlink) • (e.2) 

For L = 32, T = 0.703, the value of (Jj^^^j, computed from online measurements of 
Qiink and Qj^j, is 

link,online 



NcrLk.online = 50.88(90). (E.3) 



We could now recompute this value from offline measurements of Qiink arid Q^^A^ 



Instead, we are going to use (11.321, which involves the intermediate step of com- 
puting conditional expectation values and variances at fixed q and then integrating 
with Tp{q). This will serve as a test both of the offline measurements' precision and 
of our Gaussian convolution method for the definition of fixed-g quantities. The 
result is 

^^^lUconf = 50.81(90), (E.4) 

The precision of d^^^T^ online conf same and the difference less than 10% 

of the error bar, even though we only analysed 100 configurations per sample for 
the second one. Of course, both determinations are very highly correlated, so the 
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Figure e.i: Histogram of exponential autocorrelation times for our simulations of the L = 
32 lattice (looo samples). 

uncertainty in their difference is actually much smaller than their individual errors. 
Computing the difference for each jackknife block we see that 

N[t^lUconf - ^Uonline] = -0-065(79), (E.5) 

which is indeed compatible with zero. 

A final issue is the comparison of data computed in different system sizes at the 
same temperatures. Unfortunately the grids of temperatures that we used for the 
different L differ. Hence we have interpolated our data by means of a cubic spline. 



E.2.4 Thermalisation 



We have followed the thermalisation protocol detailed in Section A.3.3 Figure 



E.l 



shows the histogram of exponential autocorrelation times for our L = 32 simula- 
tions. As with the DAFF, we need to use logarithmic bins. Notice the dramatic 
increase of the Tgxp when decreasing the minimum temperature of the simulation. 
In Figure e.2 we plot the logarithm of the histogram to show the exponential be- 



haviour of the long-times tail. This result gives confidence that rare events with 
extremely large correlation times are at least exponentially suppressed. 



E.24.1 Thermalisation tests 

We consider in this section thermalisation tests directly based on physically mean- 
ingful quantities. 



We start by the traditional log2-binning procedure of Section A.3 illustrated in 
Figure E.3 for the Binder ratio of Eq. ( ii.i[ >, which is especially sensitive to rare 
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Figure e.2: Logarithm of the histogram of exponential autocorrelation times for our sim- 
ulations of the L = 24 lattice (4000 samples). Mind the behaviour of the long-times tail. 
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Figure E.3: Evolution of the Binder parameter for L = 32, T = 0.703 using log2 binning. 
The blue curve (circles) is the result of stopping at step 1 of our thermalisation protocol 
(i.e., all samples simulated for a fixed time of 4x10^ heat-bath updates). The red curve 
(squares) is the result of completing all the steps, which implies an increase of roughly 
150% in simulation time. 
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Figure E.4: Binder ratio as a function of the temperature for L = 32. The good overlap 
between two different simulations (one of them in the much easier critical region) is a 
further thermalisation check. We use the same set of 1000 samples. 



events and, therefore, to possible thermalisation biases. In the figure we show two 
curves, both for L = 32 and T = T^in = 0.703. The blue one, with circles, is 



the result of stopping at step one in our thermalisation protocol of Section A.3.3 



and shows a poorly thermalised ensemble. The situation improves dramatically if 
we follow the simulation protocol to the end, simulating each sample for a time 
proportional to its autocorrelation time. Notice that, thanks to our choice of M^^, 
the simulation time for most samples has not increased. If we first rescale the data 
according to total simulation length and average for equal rescaled time, the log2- 
binning gives four steps of stability within errors. That is to say: we obtain the 
Binder parameter without thermalisation bias just discarding 1/16 of the history. 
Regarding B, then, our requirement of 12Texp has been excessive. 

In retrospect, shorter simulations would have produced indistinguishable phys- 
ical results for most observables. We do not regret our choices, however, as we 
plan to use these thermalised configurations in the future [jjANXxjl for very delicate 
analyses (such as temperature chaos), which are probably much more sensitive to 
thermalisation effects. 

A different test can be performed by comparing the difficult low-temperature 
simulations of our largest lattice with simulations of the same samples in the much 
easier critical region. A faulty thermalisation (for instance, a configuration remains 
trapped at low temperatures) could be observable as inconsistencies in the values 
of quantities in common temperatures. In Figure E.4 we show the Binder parameter 



as a function of temperature for the two simulations with L = 32 (see Table E.2I. 
The agreement between both simulations is excellent. 

We carefully avoided making decisions during thermalisation based on the val- 
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0.2 0.4 0.6 1.766 1.772 1.778 

Pi\a\ < Q -(u> 

Figure E.5: Left: Scatter plot of the exponential autocorrelation time against the probability 
that the overlap be less than a small quantity at T = 0.703, for all our L = 32 samples. 
Right: Scatter plot of the autocorrelation time and the energy at T = 0.703 for the same 
samples. The thermalisation time does not seem to be correlated with these physically 
relevant quantities. 



ues of physical quantities. However, one could worry about the possibility of im- 
portant statistical correlations between the temperature random walk and interest- 
ing quantities. Such correlation could originate some small biases that would be 
difficult to eliminate. Fortunately, we have not found any correlation of this type. 
In Figure E.5 we show the correlation between Tgxp and two important quantities: 
probability of the overlap being small and the energy. 
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